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A  Theoretical  Development  of  a 
Multicarrier  Wireless  Relay 

System 
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Chapter  1 


Introduction 


A  three- node  relay  system  as  shown  in  Fig.  1.1  has  attracted  much  attention 
in  the  literature.  While  the  capacity  of  such  a  system  remains  an  open  problem,  various 
attempts  have  been  focused  on  specific  relay  schemes  and  their  achievable  regions.  One 
of  the  most  recent  works  is  in  [1]  where  the  authors  developed  algorithms  to  maximize 
the  relay  system  throughput  by  assuming  that  the  relay  node  is  non-regenerative  and  the 
powers  at  the  source  node  and  the  relay  node  can  be  controlled  over  multiple  subcarriers. 
Specifically,  the  relay  communication  in  [1]  is  achieved  in  two  phases.  In  the  first  phase, 
source  node  transmits  signal  to  relay.  In  the  second  phase,  relay  amplifies  and  transmits  the 
signal  received  from  the  first  phase  to  the  destination  node.  We  call  the  first  phase  source 
phase  and  the  second  phase  relay  phase.  In  [1],  there  is  no  direct  link  between  the  source 
node  and  the  destination  node.  As  a  result,  the  source  node  is  always  silent  during  the 
relay  phase.  With  the  presence  of  direct  link  between,  the  source  node  has  an  additional 
freedom  to  distribute  its  power  among  the  two  phases,  instead  of  only  transmitting  in  the 


2 


source  phase.  [2]  presents  a  power  allocation  algorithm  to  minimize  a  general  cost  function 
for  the  three-node  relay  systems  with  direct  link.  Moreover,  we  allow  the  source  to  transmit 
in  both  source  and  relay  phase.  The  following  conten  has  previously  published  in  [2]. 


Figure  1.1:  A  multicarrier  three-node  relay  system,  where  k  =  0, 1,  •  •  •  ,  N  —  1  is  the  index 
of  subcarriers,  and  the  source  transmits  in  both  phases  1  and  2.  S,  R  and  D  in  the  figures 
stands  for  source  node,  relay  node  and  destination  node,  respectively 

The  effect  of  the  direct  link  has  also  been  considered  in  [3,  4,  5,  6,  7,  8,  9].  But 
in  all  these  works,  the  source  node  and  the  relay  node  each  transmit  only  in  one  of  two 
separate  time  slots1  for  each  carrier.  In  fact,  except  for  [9],  all  other  works  mentioned  above 
only  allow  the  source  to  transmit  in  the  source  phase.  Many  other  prior  works  such  as 
[10,  11,  12,  13]  do  not  consider  the  direct  link.  More  recent  works  on  relays  can  be  found 
in  [14], 

In  this  paper,  we  let  the  source  repeat  a  transmission  of  the  same  information 
(transmitted  in  the  source  phase)  to  the  destination  in  the  relay  phase.  In  other  words, 
we  split  the  total  source  power  into  two  slots  for  transmitting  the  same  information.  This 
additional  freedom  makes  the  required  power  allocation  algorithms  differ  from  those  pub¬ 
lished  before.  The  achieved  capacity  is  higher  than  the  case  where  the  source  power  is  only 
limited  to  one  of  two  time  slots  for  each  carrier,  which  is  a  special  case  of  our  relay  scheme. 
However,  this  new  scheme  introduces  several  technical  challenges  as  elaborated  below. 

•  More  Complicated  Non-convex  Optimization  Objective:  Since  source  node 
1We  use  “phase”  and  “slot”  interchangeably. 
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can  transmits  in  both  phases,  we  need  to  consider  the  joint  optimization  of  source 


power  allocation  at  both  phases  and  relay  power  allocation  at  the  relay  phase.  This 
not  only  increases  the  number  of  optimizatin  variables  but  also  makes  the  optimization 
objective  a  more  complicated  non-convex  function. 

•  More  Complex  Coupling  between  Source  and  Relay  Power  Allocation:  The 

source  and  relay  power  optimization  is  coupled  in  a  more  complex  way  as  the  desti¬ 
nation  node  gets  signal  from  both  source  and  relay  nodes  during  the  relay  phase. 

From  an  application  point  of  view,  the  three-node  multicarrier  relay  system  shown  in  Fig. 
1.1  can  be  a  simple  abstraction  of  a  multiuser  multicarrier  downlink  relay  system  shown 
in  Fig.  1.2  where  the  destination  nodes  can  select  different  subcarriers  in  each  phase.  In 
the  multiuser  case,  the  channel  gains  of  a  link  among  different  subcarriers  are  generally 
diverse,  and  hence  the  power  allocations  at  both  the  source  and  relay  nodes  over  all  sub¬ 
carriers  are  particularly  important.  The  proposed  algorithm  can  also  be  used  to  solve  the 
power  allocation  problems  in  the  multiuser  multicarrier  relay  systems  for  given  subcarrier 
allocation. 
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Figure  1.2:  A  multiuser  application  of  the  relay  system  shown  in  Fig.  1.1  where  the  des¬ 
tination  nodes  (users)  can  select  different  subcarriers  embedded  in  OFDM  frames  in  each 
phase. 
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Several  of  the  previous  works  such  as  [3,  4,  5,  6,  7,  8]  considered  MIMO  relay 
systems.  The  work  in  this  paper  is  limited  to  single- antenna  nodes.  While  our  contributions 
do  not  entirely  generalize  the  previous  works  on  MIMO  relay  systems,  the  insight  shown  in 
this  paper  could  help  achieve  that  goal  in  the  future. 

In  the  next  section,  we  present  in  detail  the  system  model  and  the  problem  for¬ 
mulation.  The  problem  formulated  is  a  joint  optimization  of  source  power  allocation  over 
two  phases  and  relay  power  allocation  in  the  second  phase.  Two  types  of  cost  functions 
are  considered.  Using  the  alternating  optimization  (AO)  method[12],  the  joint  optimization 
problem  is  decomposed  into  two  subproblems,  namely  the  optimization  of  relay  power  allo¬ 
cation  for  given  source  power  allocation,  and  the  optimization  of  source  power  allocation  for 
given  relay  power  allocation.  These  two  subproblems  are  respectively  addressed  in  Section 
3  and  Section  4.  Then  the  overall  AO  algorithm  and  the  optimality  analysis  is  given  in 
Section  5.  Although  the  exact  solution  to  the  joint  optimization  problem  is  not  guaranteed, 
the  AO  algorithm  converges  to  a  stationary  point  which  is  asymptotically  optimal  for  large 
relay  transmit  power  or  large  source-relay  channel  gain.  Simulation  examples  are  given  in 
Section  6  to  illustrate  the  superior  performance  of  the  proposed  AO  algorithm. 
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Chapter  2 


System  Model  and  Problem 
Formulation 


The  relay  architecture  is  shown  in  Fig.  1.1.  We  consider  an  OFDM  based  multi¬ 
carrier  system.  Each  packet  of  information  is  encoded  into  N  independent  complex  symbols, 
X) fc,  k  =  0, 1,  •  •  •  ,  N  —  1,  of  zero  mean  and  unit  variance.  To  transmit  a  packet  of  information 
from  the  source  to  the  destination,  the  relay  scheme  has  two  phases. 

In  source  phase,  the  source  transmits  =  ^Pi,k%k,  k  =  0, 1,  •  •  •  ,1V  —  1,  over  N 
subcarriers  towards  the  destination  and  the  relay,  the  relay  does  not  transmit  but  receives 
Vr,ki  k  =  0, 1,  •  •  •  ,  N  —  1,  and  the  destination  receives  y(i\,k-  k  =  0, 1,  •  •  •  ,  N  —  1.  The  power 
of  xhk  is  p1:k. 

In  relay  phase,  the  source  transmits  again  but  it  transmits  x 2,k  =  \/P2,kXk>  k  = 
0, 1,  •  •  •  ,  N  —  1,  the  relay  does  not  receive  but,  concurrently  with  the  source,  transmits 
xr,k  =  9kVr,k j  k  =  0, 1,  *  ■  ■  ,  N  —  1,  and  the  destination  receives  y<i2,k ,  k  =  0, 1,  ■  ■  ■  ,  N  —  1. 
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The  relay  factor  has  the  amplitude  gk  and  the  phase  /3k-  The  power  of  X2,k  is  P2,k- 

The  channel  coefficient  of  the  fc-th  subcarrier  from  the  source  to  the  destination 
is  denoted  by  hi,k,  that  from  the  source  to  the  relay  is  fi2,k,  and  that  from  the  relay  to  the 
destination  is  h 3^.  We  use  h^k  as  the  amplitude  of  h^k,  and  the  phase  of  For 
clarity,  we  assume  that  all  the  channel  coefficients  are  non-zero.  The  algorithms  developed 
in  this  paper  can  be  easily  extended  to  solve  the  cases  when  some  of  the  channel  coefficients 
are  zero. 

The  N  subcarriers  used  in  source  phase  for  x\ ^  are  indexed  by  k  =  0, 1,  •  •  •  ,N  —  1. 
But  the  N  subcarriers  used  in  relay  phase  for  X2,k  (from  source  to  destination)  and  xT)k 
(from  relay  to  destination)  are  indexed  by  k  =  0, 1,  •  •  •  ,  N  —  1,  which  can  be  a  permutation 
of  those  used  in  source  phase.  Namely,  k  is  a  function  of  k  via  a  given  permutation. 
Permuation  can  be  used  to  model  the  subcarrier  allocation  in  multi-user  relay  systems, 
where  a  user  may  be  allocated  with  a  different  subcarrier  in  the  relay  phase. 

Now,  we  can  write  that  in  source  phase,  the  destination  and  the  relay  receive, 
respectively, 

Vdl,k  =  hl,k%l,k  T  fldl,k  (2.1) 

Vr,k  =  h2,k%l,k  T  f^r,k  (2-2) 

and  in  relay  phase,  the  destination  receives 

Vd2,k  =  h3j.Xrjk  +  h1j,X2,k  +  (2-3) 

where  k  in  h3  j.  and  h1  j,  of  (2.3)  is  a  permuted  version  of  k  in  hi^  and  h,2tk  °f  (2.1)  and 
(2.2).  All  the  noise  terms  n^i^,  hr^k  and  hd2,k  are  assumed  to  be  independent  of  each  other 
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and  across  subcarriers,  and  have  zero  mean  and  unit  variance.  Note  that  there  is  no  loss 
of  generality  to  assume  the  unit  variance.  If  the  variance  of  h(n.k  and  hd2,k  is  7 d,  we  can 
divide  (2.1)  and  (2.3)  by  y/yd  and  adjust  the  notations.  And  if  the  variance  of  nr)k  is  yr, 
we  can  divide  (2.2)  by  y/qy  and  adjust  the  notations. 

Combining  all  the  previous  equations,  we  can  write  the  received  signals  in  both 
phases  as  a  vector: 

y  k  =  iifcXfc  +  nk  (2.4) 


with  yk  =  [  ydl^  yd2  k  ]T  and 

hfc  =  [  y/Pi^hitk,  y/pi^gkh2,kh3,k  +  \/P^kK,k  ^  (2'5) 

nfc  =  [  hdi,k,  gkhz)knr)k  +  hd2  k  }T  (2-6) 

Since  the  covariance  matrix  of  the  noise  vector  hk  is  Ck  =  diag  ^1,  g\ k  +  1^ ,  a  sufficient 
statistics  for  xk  is 

xk  =  hkCkljyk  (2.7) 


One  can  then  verify  that  the  SNR  of  xk  is 

\y/Pi,kgkh2,kh3k  +  ^/P-2,khlk\2 


SNRfc  —  Pi^khik  + 


9lhl,k  +  1 


(2.8) 


It  follows  from  (2.8)  that  /3k  =  alk  —  a 2,k  ~  a 3  k  is  the  phase  of  gk  that  maximizes 
SNRfc.  With  the  optimal  phase  (3k,  (2.8)  becomes 


OArD  ,2  ,  {y/Ppk9kh2,khzk  +  y/p%khhk)2 

SNRfc  =  p\,kh%k  H - 


(2.9) 


9khl,k  +  1 

Let  source  node  transmit  in  relay  phase  will  result  in  a  better  SNR  especially  when 


relay  power  is  limited  or  direct  channel  is  strong.  The  simulation  results  also  show  that  the 


optimized  P2,fc’s  are  typically  larger  than  zero.  A  more  rigorous  discussion  will  be  given  at 
the  end  of  Section  4. 

We  will  consider  two  types  of  cost  functions.  The  first  is 


J\  =  —  min  SNRfc. 

k 


(2.10) 


This  cost  could  be  particularly  useful  if  each  of  the  N  subcarriers  is  occupied  by  a  distinct 
downlink  user  (see  Fig.  1.2)  and  we  want  to  ensure  a  fair  quality  for  all  users.  The  second 
cost  is 

N—\ 

J2  =  E  /fc(SNRfc)>  (2-H) 


k= 0 


where  /&( SNR*.)  is  any  decreasing  convex  function  of  SNR*.  >  0,  i.e.,  f'k  = 
and  fi.  =  r>  l ^  I  >  0.  A  special  form  of  J-2  is 


i=SNRfc 


<  0 


x=SNKk 


N—l 


J2  =  -^E1°g2(1  +  SNRfc), 


(2.12) 


k= 0 


which  is  the  negative  of  the  capacity  of  the  relay  system  under  the  previously  defined  relay 
scheme.  Clearly,  there  are  many  more  variations  of  the  cost  functions.  But  we  hope  that 
our  results  for  J\  and  J2  can  help  solve  other  related  problems. 

The  joint  source  and  relay  power  allocation  problem  is  formulated  as: 


min  J.  (2-13) 

Pl,k,P2,k,9k  jVfc 

subject  to 

N—l 

y  (pi,k + P2,k)  <  ps,  (2-i4) 

k= 0 
N-l 

E  {Skp2,kPl,k  +  9k)  ^  Pr ■  (2-15) 

k= 0 
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along  with  g &  >  0,  p\.k  >  0  and  p2,k  >  0.  Here,  J  is  either  J\  or  J2,  Pr  is  the  power 
constraint  at  the  relay,  and  Ps  is  the  sum  power  constraint  at  the  source. 

The  above  problem  differs  from  those  in  [4,  5,  6,  7,  8]  in  a  fundamental  way.  As 
mentioned  earlier,  all  those  references  assume  that  the  source  node  is  silent  during  the  relay 
phase.  As  shown  later  in  this  paper,  by  allowing  the  source  power  to  be  distributed  in  both 
phases,  a  substantial  performance  gain  can  be  achieved. 

The  exact  solution  to  (2.13)  is  difficult  to  find  because  both  the  cost  function 
and  the  relay  power  constraint  in  (2.15)  are  non-convex.  In  this  paper,  we  provide  a  local 
optimal  solution  by  adoptting  an  alternating  optimization  approach  where  we  optimize  gk 
\/k  with  previously  given  p\yk  and  p2,fc  Vfc,  then  optimize  p\,k  and  p2,fc  V7c  with  previously 
given  gk  Vfc,  and  then  repeat  the  process  until  convergence.  For  fixed  source  power  allocation 
Pi,k,P2,k^k,  the  relay  power  allocation  problem  (i.e.,  the  optimization  of  g^k)  is  still  non- 
convex.  However,  by  exploiting  the  specific  structure  of  the  SNR  expression  in  (2.9),  we 
propose  efficient  algorithms  to  find  the  unique  optimal  solution  for  this  non-covex  problem 
in  Section  3.  For  fixed  relay  amplifying  factor  g^,  the  dual-phase  source  power  allocation 
problem  (i.e.,  the  optimization  of  PikiP2k^k)  is  convex  and  we  propose  efficient  algorithms 
to  find  the  optimal  solution  in  Section  4.  The  overall  algorithm  to  solve  Problem  (2.13)  is 
given  in  Section  5. 
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Chapter  3 


Optimal  Relay  Power  Allocation 
given  Source  Power  Allocation 


For  fixed  p\yk  and  P2,ki  Problem  (2.13)  becomes 


min  J,  (3.1) 

9fc>0,Vfc 


subject  to 


where  cu  =  h\kpi,k  +  1. 


N—\ 


'y  1  ckSk  Pr- 

k= 0 


(3.2) 


Neither  J±  nor  J-2  is  a  convex  functions  of  gk  V&.  In  particular,  the  SNR^  in  (2.9) 
is  not  a  concave  function  of  gk-  However,  it  is  easy  to  verify  that 


dSNRfc 

dgk 


2{y/Phk9kh2,kh3'k  +  VP^khhk) 
(y/P^h2,kh3,k  \fP‘2-k9khl  kh23  l 

(9khlk  +  !)2 


(3.3) 
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It  follows  that  SNR,/,,  is  a  quasi-concave  function  of  g/,  which  has  a  unique  maximum  at1 


9k  ~ 


h2,ky/Pl,k 

hl,kh3, ky/PZ’k 


(3.4) 


Namely,  SNR/  is  an  increasing  function  of  gk  for  g /  £  [0,  gk)  and  a  decreasing  function  of  gk 
for  gk  £  (g),  oo).  Because  of  the  power  constraint  (3.2),  we  only  need  to  search  the  optimal 
gk  within  [0,  </£].  A  typical  plot  of  SNR,/,,  is  shown  in  Figure.  3.1. 


Figure  3.1:  A  typical  plot  of  SNR,/,  vs.  gk  where  g*k  is  the  position  of  the  peak  value  SNR,),. 


Note  that  if  the  source  does  not  transmit  in  relay  phase,  then  =  0  and  hence 
SNR/,  is  simply  an  increasing  concave  function  of  gk  (and  J\  and  J-2  are  convex  functions 
of  gk  Vk). 


3.1  Using  the  Cost  J  =  J\ 

Recall  J\  =  —  min/  SNR/  and  minrgfcvfc}  J\  =  max{9;v/}  min/  SNR/.  Denote  SNR),  = 

SNR/|9fc=9*.  Also  denote  the  solution  to  (3.1)  by  g\z  and  the  corresponding  optimal  SNR 
xNote  that  if  =  0,  we  let  gl  =  oo. 
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by  SNRfc  =  SNRfc|9fc=gfe.  From  (2.9),  if  we  let  =  0,  we  have  the  corresponding  SNR&  = 
Pi,khf  k+p2,khg  i-  Since  SNRfc  is  a  monotonic  increasing  function  of  gk  over  [0,gk),  we  must 
have: 

Pi,khlk  +  p2,khl~k  <  SNR  *  <  SNR^,  Vfc.  (3.5) 

Also,  because  SNR*,  is  an  increasing  function  of  gk  for  gk  6  [0,^),  it  follows  that 

SNRfc  =  min  SNR;,  VA:  €  jo  <  k  <  N  —  1  :  gk>  >  o|  . 

It  is  easy  to  verify  from  (2.9)  that  for  each  g  >  0,  the  equation  SNR^  =  g  can  be 
reduced  to  a  quadratic  equation  of  gk,  which  has  two  (closed- form)  solutions  gk 1  and  g^\ 
Furthermore,  if  0  <  g  <  SNR|, ,  one  (and  only  one  value)  of  g^  and  g^  is  less  than  or 
equal  to  gk,  which  is  denoted  by  gk(g)- 

With  the  above  discussions,  one  can  verify  that  the  following  algorithm  yields  the 
exact  solution  to  (3.1)  with  J  =  Jg. 

Algorithm  1: 

1.  Compute  gk,  SNRjf,  and  g*  =  min*,  SNR£. 

2.  For  each  k,  solve  the  equation  SNR&  =  g*  to  obtain  gk{g*)-  If  (3.2)  is  satisfied  with 
gk  =  max(0 ,gk{g*))  VA;,  then  set  gk  =  max(0 ,gk(p*))  VA;  and  stop.  Otherwise,  go  to 
the  next  step. 

3.  Use  the  bisection  search  method2  [15]  to  find  g  within  (0 ,g*)  such  that  (3.2)  is 
satisfied  with  equality  as  Ylk=o  ck9k  =  F r  and  9k  =  max(0,  gk{g))  VA;.  Then,  set 
gk  =  max(0 ,gk(g))  VA:,  and  stop. 

2It  converges  exponentially  fast. 
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3.2  Using  the  Cost  J  =  J2 


Recall  J2  =  Ufc /fc(SNRfc)  where  fk(x)  is  a  decreasing  convex  function  of  x  >  0. 

To  find  the  solution  to  (3.1),  the  KKT  conditions  [15]  are  the  necessary  conditions  (but  not 
sufficient  conditions  due  to  non-convexity  of  J2  with  respect  to  g k  V/c).  But  we  can  still 
distill  the  exact  solution  from  the  KKT  conditions  as  follows.  Among  the  KKT  conditions3, 
we  have 

Jk  dgk  +  2ll9kCk  -  Ik  =  0,  V/c,  (3.6) 

where  f'k  =  <  0  and  dS^k  is  given  in  (3.3),  /j  is  the  Lagrange  multiplier  associated 

with  the  relay  power  constraint  in  (3.2),  and  7 k  is  the  Lagrange  multiplier  for  the  constraint 
9k  >  0.  Also,  //  >  0  if  the  equality  in  (3.2)  holds,  and  7^  =  0  if  gk  >  0. 

The  following  proposition  characterizes  the  properties  of  the  solution  of  the  KKT 
conditions. 

Proposition  1  The  KKT  conditions  of  the  relay  power  allocation  problem  in  (3.1)  with 
J  =  J2  has  a  unique  solution  denoted  by  fi,  {7 k,gk  '■  V/c}.  Moreover,  fi,  {7 klgk  '■  V/c}  has  the 
following  properties: 

1-  9k  =  0,7 k  =  0,  V/c  G  /C0  =  j/c'  :  pl  k>  =  0  j  and  gk  >  0,7fc  =  0,  V/c  G  /C0  — 

{*'  :  Pi,k'  >  °}- 

2.  IfJ2k=d  ck9k2  —  U;  we  have  gk  =  gk ,  V/c;  otherwise  gk  =  0,  V/c  G  /Co  and  p,,  { gk  :  V/c  G  /Co} 

*sFor  brevity,  we  do  not  list  all  KKT  conditions.  We  assume  that  the  reader  understands  the  basic 
components  of  the  KKT  conditions. 
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is  the  unique  solution  of  the  following  equations: 


j  riSNRfc 

k  2 gkdgk 

J2  Ckgk 

keic  o 


v/c  G  /C 0; 
Pr. 


3.  In  (3.7),  —f'k2^dgkk  Vositive  and  decreasing  with  g k  G  [0,  g)*]. 


(3.7) 

(3.8) 


Lei  gk(^),yk  G  /Co  denote  the  solution  of  (3.7)  with  fixed  /a.  Then  we  have  gk{g)  G 
[0,g£,]  and  YlkeK0  ck9k (m)  *s  decreasing  with  g. 


Please  refer  to  Appendix  8.1  for  the  proof. 

By  Result  3)  in  Proposition  1,  for  a  given  feasible  //,  the  solution  gk{g),'ik  G  /Co 
of  (3.7)  can  be  easily  found  by  a  bisection  search.  On  the  other  hand,  if  J2k=o  ck9k2  >  -Pr 
the  optimal  Lagrange  multiplier  g  is  given  by  the  solution  of 


ck9k(h)  =  Pr,  (3.9) 

keic  o 

which  can  also  be  found  by  a  bisection  search  according  to  the  Result  4)  in  Proposition  1. 

Based  on  Proposition  1,  one  can  verify  that  the  following  algorithm  yields  the 
exact  solution  to  Problem  (3.1)  with  J  =  J2: 

Algorithm  2: 


1.  Compute  gl  \/k.  If  (3.2)  holds  with  gk  =  g k,  set  (jk  =  g*k  Vfc  and  stop.  Otherwise,  go 
to  the  next  step. 


2.  Use  a  bisection  search  to  find  fi  within  [0,  00]  such  that  Equation  (3.9)  is  satisfied, 
where  for  a  given  g,  the  gk{k),\/k  G  /Co  in  (3.9)  is  found  as  the  solution  to  (3.7)  via 
an  inner-loop  bisection  search. 


15 


Chapter  4 

Optimal  Source  Power  Allocation 
given  Relay  Power  Allocation 

Given  a  relay  power  allocation  (i.e.,  given  a  set  of  (jk  Vfc),  we  now  consider  the 
optimization  of  the  source  power  allocation  by  solving  the  following  problem: 

min  J,  (4.1) 

Pi,fc>0,P2,fc>0,Vfc 

subject  to  (2.14)  and  (2.15)  ,  which  is  later  proved  convex. 

4.1  Using  the  Cost  J  =  J\ 

Since  J\  =  —  min^SNRfc,  Problem  (4.1)  can  be  reformulated  as 

min  — 6 ,  (4.2) 

0,Pi,k>O,p2,fc>O,  Vfc 

subject  to  (2.14),  (2.15)  and 

SNR  k>e,\/k.  (4.3) 
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It  follows  from  (2.9)  that 


SNRfc  T*kPl,k  “1“  ^ky/Pl,kP2,k  ^kP2,ki 


(4.4) 


where  =  /i?  .  +  9k,^f  ,  Sfe  =  gkhl^2^3’k  and  tk  =  ■>,  l'k .  . .  It  is  obvious  that  SNR/C  is 

increasing  with  p\^  and  P2,k- 

It  is  also  obvious  that  the  optimal  p\^  and  j>2,fc  must  be  such  that  the  equality  in 
(2.14)  is  satisfied.  If  p\tk  and  p2,fc  are  such  that  the  strict  inequality  in  (2.14)  holds,  we  can 
increase  some  or  all  of  j>2,fc  such  that  the  equality  in  (2.14)  is  achieved  and  6  is  increased 
while  (2.15)  is  not  affected. 

Furthermore,  the  optimal  p\^  and  p2,fc  must  be  such  that  SNR/,,  =  9 ,  Vfc.  If  there 
is  SNR;  satisfying  SNR/  >  min/j  SNR/,.,  we  can  reduce  pk)i  to  make  SNR /  =  ruin*.  SNR*, 
without  affecting  9  and  then  we  can  increase  some  or  all  of  p2,fc  such  that  the  equality  in 
(2.14)  is  achieved  and  9  is  increased  while  (2.15)  is  not  affected. 

Applying  the  KKT  conditions  [15]  to  the  problem  (4.2),  we  have 


x  ,  ,  sk^W^\ 

-A*  rk  +  -  - _ 


+  +  /q  —  0,  Vfc, 


-Afe  (  tk  +  )  +  hi  =  0,  V/c, 


(4.5) 

n  =—  I  1  r-x  (4.6) 

Z^P2,k 

where  A  &  is  the  fc-tli  multiplier  due  to  (4.3),  ji\  is  the  multiplier  due  to  (2.14),  and  [12  is  the 
multiplier  due  to  (2.15).  The  discussions  shown  previously  imply  that  /q  >  0  and  A k  >  0, 
Vfc. 

The  following  proposition  gives  the  solution  to  the  above  KKT  conditions  for  fixed 


Pl,P2- 
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Proposition  2  For  fixed,  Hi,H2,  the  solution  to  the  KKT  conditions  of  Problem  (4.2)  with 
J  =  J\  is  given  by 


Pi,k  =  < 


0, 

_e_ 

rk' 

e 


rk+ytk  ' 

e 


K  rk+VkSk+nJtf' 


tkl  -  rk  >  0,  sk  =  o 
4-7  -  rk  <  0,  sk  =  0 
47  -  rk  =  0,  sk  =  0 

Sk>  0 


P2,k  =  < 


47  ~  rk  >  0,  Sk  =  0 
0,  4-7  -  rk  <  0,  sk  =  0 

4-7  —  rk  =  0,  Sk  =  0 
VkPl,k,  sk>0 

where  7  =  g‘f.h'2  kj^  +  1/  Pk,  V&;  G  jfe  :  s*,  >  0  j  is  <?4en  by 


(4.7) 


(4.8) 


=  a/—  =  —  (4-7  -  rk  +  J (4-7  -  rfc)2  +  s|7  )  ,  (4.9) 

V  Pi,fe  &k  \  J 

and  0  is  chosen  such  that  (2.14)  is  satisfied  with  equality. 

Please  refer  to  Appendix  8.2  for  the  proof. 

By  Proposition  2,  for  fixed  /U-i ,  /l*2,  the  solution  of  the  KKT  conditions  only  depends 
on  the  intermediate  variable  7  =  gf. kj^  +  1.  Hence,  if  the  solution  in  Proposition  2  with 
7=1  (i.e.,  /1 2  =  0)  satisfies  the  relay  power  constraint  (2.15),  it  is  also  the  optimal  solution 
for  Problem  (4.2)  with  J  =  J\.  Otherwise,  we  can  use  bisection  search  to  find  the  7  such 
that  the  solution  in  Proposition  2  satisfies  (2.15)  with  equality  and  this  solution  is  optimal 
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for  Problem  (4.2).  Note  that  the  probability  for  Sk  =  ry.  =  tg  =  0  is  virtually  zero.  So,  the 
last  expressions  in  (4.7)  and  (4.8)  should  be  stable  numerically. 

From  the  above  analysis,  we  obtain  the  following  algorithm  which  finds  the  exact 
solution  to  (4.2)  with  .7  =  ,J\ : 

Algorithm  3: 

1.  Initialize  7  =  1  and  9  >  0. 

2.  Determine  rg-  from  (4.9)  for  all  k  where  Sk  >  0. 

3.  Determine  p\g-  and  P2 7.  by  (4.7)  and  (4.8). 

4.  One  step  bisection1:  If  the  left  side  of  (2.14)  is  smaller  than  its  right  side,  increase2 
9.  If  the  opposite  is  true,  decrease  9.  Go  to  step  3)  until  convergence. 

5.  If  7  =  1  and  (2.15)  is  satisfied,  stop.  Otherwise,  go  to  the  next  step. 

6.  One  step  bisection3:  If  the  left  side  of  (2.15)  is  larger  than  its  right  side,  increase  7. 
If  the  opposite  is  true,  decrease  7.  Go  to  step  2)  until  convergence. 


In  our  simulations,  “convergence”  means  that  the  difference  between  the  coefficients  ob¬ 
tained  in  two  consecutive  iterations  is  less  than  a  pre-set  threshold. 

1The  lower  bound  of  9  is  obviously  zero.  An  upper  bound  of  9  for  each  7  can  be  found  by  doubling  its 
previous  (nonzero)  value  each  time  initially  as  9  is  increased  consecutively  in  the  bisection  search. 

2In  bisection,  “increase  a  variable”  means  “increase  the  variable  halfway  towards  its  nearest  larger  es¬ 
timate  obtained  previously”,  and  “decrease  a  variable”  means  “decrease  the  variable  halfway  towards  its 
nearest  smaller  estimate  obtained  previously”. 

3 An  upper  bound  of  7  can  be  found  by  doubling  its  previous  (nonzero)  value  each  time  initially  as  7  is 
increased  consecutively  in  the  bisection  search. 
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4.2  Using  the  cost  J  =  J2 


The  cost  function  J2  can  be  expressed  as 
TV— 1 


J2  =  ^2  fk{rkPl,k  +  Sky/Pl,kP2,k  +  tkP2,k)- 


(4.10) 


k= 0 


Let.  p  =  [pi;o,  •  •  •  ,Pi,n-uP2,o,  •  •  •  ,P2,N-i]7  and  let  H  be  the  Hessian  matrix  of  J2 
with  respect  to  p.  Then,  one  can  verify  that  for  any  real  vector  v  £  M2Ar, 

TV-1 


V 


Hv  =  \  fk  '  (  ak 


k= 0 


rk  +  sk 


y/Wfi 

2y/Ppk. 


sjv^k  ' 
Sk  n  , -  T  tk 

2y/P%k 


J  2  y/P2,k  J  ,2  y/Phk 
—  Jk  '  aksk  ~F==  -  Jk  ■  bksk~ 


>  0, 


(4.11) 


where  we  have  applied  fk  <  0  and  fk  >  0.  Hence,  J2  is  convex  with  respect  to  p.  The 
KKT  conditions  of  (4.1)  with  J  =  J2  include 


fk  ■  d^Rk  +  H2 (glhlk)  +  Pi  =  0,  Vfc, 
dpi,k 

,  dsmik  _ 

fk  '  Pi  —  0,  VK, 

dp2,k 


(4.12) 

(4.13) 


where,  as  discussed  previously,  p\  >  0  and  P2  >  0.  The  following  proposition  gives  the 
solution  to  the  above  KKT  conditions  for  fixed  pi,  /12- 


Proposition  3  For  fixed  pi,p2,  the  solution  to  the  KKT  conditions  of  Problem  (4.2)  with 
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J  =  J2  is  given  by 


Pi,k  =  < 


0, 


t- 1/  “1' 

fc 

fk 


f'-1  [  "Ml 

I  .  ,  sfc_ 


tk^-rk  >  0 


tk^-rk<  0 


7  + 3 


’ffc 


P2,fc  =  < 


sk  >  0 


0, 

^rk+tk 

rikPi,k, 


,  tk^~rk>  0 


tk^-rk  <  0 


■  j  tk  fk  —  0 


Sk>  0 


where  7  =  g\h +  1,  and  r)k,\/k  G  j/e  :  S&  >  oj  is  given  in  (4-9). 


(4.14) 


(4.15) 


Please  refer  to  Appendix  8.3  for  the  proof. 

Summarizing  the  above  analysis,  we  have  the  following  algorithm  for  finding  the 
exact  solution  to  (4.1)  with  J  =  Jp- 
Algorithm  4: 


1.  Initialize  7  =  1  and  fi\  >  0. 

2.  Compute  r]k  from  (4.9)  for  all  k  where  sk  >  0.  Compute  pijk  and  p2)k  by  (4.14)  and 
(4.15)  for  all  k. 

3.  One  step  bisection:  if  the  left  side  of  (2.14)  is  smaller  than  its  right  side,  decrease  fi\. 
If  the  opposite  is  true,  increase  p\.  Go  to  step  2)  until  convergence. 


21 


4.  If  7  =  1  and  (2.15)  is  satisfied,  stop.  Otherwise,  go  to  next  step. 


5.  One  step  bisection:  If  the  left  side  of  (2.15)  is  larger  than  its  right  side,  increase  7.  If 
the  opposite  is  true,  decrease  7.  Go  to  step  2)  until  convergence. 

4.3  Discussion  of  the  Optimal  Source  Power  Ratio  — 

1  Pi,k 

Some  properties  of  the  optimal  source  power  ratio  ^7;  are  shown  below. 

•  Conditions  for  non-zero  source  transmit  power  in  the  relay  phase:  For  any 

given  gk  \/k,  the  optimal  is  given  in  (4.9),  which  is  larger  than  zero  if  Sk  >  0.  It 
follows  from  the  definition  of  Sk,  shown  below  (4.4),  that  Sk  >  0  if  and  only  if  g k  >  0, 
h1  j.  >  0,  h2g  >  0  and  h3  j,  >  0.  If  Sk  =  0,  it  follows  from  (4.4)  (i.e.,  maxpi  kjP2  k  SNF4 
subject  to  pig-  +  p2g  <  Pk  where  pk  is  any  amount  of  power  for  the  kth.  subcarrier) 
that  p\g  =  Pk  and  p-2g  =  0  if  rk  >  4,  or  otherwise  pig  =  0  and  p-2g  =  Pk  if  <  4- 
It  is  obvious  from  the  definitions  of  rk  and  4  that  rk  >  tk  if  k  =  k.  But  if  k  7^  k,  rk 
may  not  always  be  larger  than  4- 

•  Optimal  under  weak  direct  link  channel  gain  h\g:  If  the  direct  link  has 
an  infinite  attenuation  (or  weak  in  the  extreme),  i.e.,  h\g  ~  0  Vfc,  then  Sk  =  0, 
fk  >4  =  0  and  hence  ^7  =  0-  In  general,  as  the  direct  link  becomes  weaker,  jgg 
should  become  smaller. 

•  Optimal  jgg  under  strong  direct  link  channel  gain  h\g:  if  the  direct  link  (h\g 
Mk )  becomes  relatively  strong  compared  to  the  relay  links  (h2g  and  h^g  Vfc),  then  rk 
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and  tk  become  more  dominant  than  Sk-  In  this  case,  becomes  either  very  large  or 
very  small  depending  on  which  of  r>>;  and  tk  is  larger  than  the  other. 
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Chapter  5 


Overall  Algorithm  and  Optimality 
Analysis 


The  overall  algorithm  (named  Algorithm  AO)  for  solving  Problem  (2.13)  is  sum¬ 
marized  below. 

Algorithm  AO: 

1.  Use  algorithm  1  (or  algorithm  2)  with  a  pre-select  source  power  to  find  a  new  set  of 
relay  power,  then  go  to  Step  2); 

2.  Use  algorithm  3  (or  algorithm  4)  with  newly  updated  relay  power  to  find  a  new  set  of 
source  power,  then  go  to  Step  3); 

3.  Use  algorithm  1  (or  algorithm  2)  with  newly  updated  source  power  to  find  a  new  set 
of  relay  power,  then  go  to  Step  2)  until  converge,  |AJ|  <  e/. 

In  general,  the  alternating  optimization  method  may  fail  to  locate  the  stationary  points,  not 
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to  mention  the  global  convergence  to  the  optimal  solution.  However,  since  the  constraint 
functions  in  (2.14-2.15)  are  convex  either  for  fixed  {pi,k,P2,k}  or  for  fixed  {gk},  and  Algo¬ 
rithm  AO  belongs  to  the  nonlinear  Gauss-Seidel  (GS)  method  [16]  where  the  optimization 
vector  is  only  partitioned  into  two  component  vectors,  it  follows  from  [16]  that  Algorithm 
AO  converges  to  a  stationary  point  of  Problem  (2.13)  as  stated  in  the  following  theorem. 

Theorem  1  (Local  convergence  of  Alg.  AO)  Any  limit  point  {p\!k-,P2,ki9k  '■  VA;}  gen¬ 
erated  by  Algorithm  AO  is  a  stationary  point  of  Problem  (2.13). 

In  the  following,  we  derive  the  asymptotically  optimal  solutions  respectively  for  large  relay 

transmit  power  Pr  and  large  source-relay  channel  gain  I12  —  min  h-2  k ■  Based  on  this,  we 

k 

show  that  the  stationary  point  found  by  Algorithm  AO  is  asymptotically  optimal  for  large 
/? 2  or  Pr.  Throughout  this  section,  we  will  explicitly  express  the  cost  J  {{pi,k,P2,k9k})  as  a 
function  of  {p\.k:P2,k9k}  for  clearness. 

5.1  Asymptotically  Optimal  Solution  for  Large  Relay  Trans¬ 
mit  Power 

First,  we  give  an  upper  bound  for  the  SNR  and  show  that  the  SNR  upper  bound 
can  be  approached  as  Pr  grows  large. 

Lemma  1  For  fixed  p\^  +  P2,k  =  Pk,  the  SNR  on  the  k-th  subcarrier  is  upper  bounded  by 

SNR k  <  SNR f  =  Pkmax  +  h\ik,  h\  ^  . 


25 


Moreover,  if  we  let 


Pi,k  =  Pkl  (hl,k  +  hi,k  >  hl,k)  > 

P2,k  =  -Pfcl  (h,2 ,k  +  h\k  <  hfi^j  , 

0,  ifh\k  +  h\k  <  h\-k, 

otherwise, 


9k  =  < 


(5.1) 

(5.2) 

(5.3) 


prh  lk 


tV  iV(1+/l2.fe?,i.fc): 

then  the  corresponding  SNR/,,  satisfies  SNR jl1  —  SNR^  =  O  (jy;)  for  sufficiently  large  Pr, 
where  I(-)  denotes  the  indication  function  such  that  I  (E)  =  1  if  the  event  E  is  true  and 
I  (E)  =  0  otherwise. 


Please  refer  to  Appendix  8.4  for  the  proof. 

Using  Lemma  1,  we  can  obtain  an  asymptotically  optimal  solution  of  Problem 
(2.13)  for  large  Pr  by  solving  a  convex  optimization  problem  as  stated  in  the  following 
theorem. 


Theorem  2  (Asymptotically  optimal  solution  for  large  Pr)  Let{Pf  :  V/c}  denote  the 
optimal  solution  of  the  following  convex  optimization  problem 


N- 1 


min 

{Pk>  0:Vfc} 


J,  s.t.  ^2  pk  <  P.s, 

k=0 


(5.4) 


where  either  J  =  — mmSNR^  or  J  =  fk  (SNR^) .  Obtain  k,p ^  k,  gk  :  Vfc  j  using 

(5. 1-5. 3)  with  Pk  replaced  by  Pf.  Then  for  sufficiently  large  Pr,  |p°  kdP°2  k-9k  :  Vfe|  is 
an  asymptotically  optimal  solution  of  Problem  (2.13),  i.e.,  jit  kdP2  k9k  :  ^|  satisfies  the 
power  constraints  in  (2.14-2.15)  and 


J  {{Pi,k,P°2,k9°k})  ~  J* 
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(5.5) 


where  J*  is  the  optimal  value  of  Problem  (2.13). 


Please  refer  to  Appendix  8.5  for  the  proof. 

Based  on  Theorem  2,  we  establish  the  asymptotic  optimality  of  Algorithm  AO  for 

large  Pr. 

Theorem  3  (Asymptotic  optimality  of  Alg.  AO  for  large  Pr)  Suppose  that  in  Al¬ 
gorithm  AO,  the  initial  point  is  chosen  such  thatpi }k  =  O  (1),  P2,k  =  O  (75-)  if  ^2  +  k  > 
and  p\tk  =  O  P2,fc  =  O  (1)  otherwise.  Then  for  sufficiently  large  Pr,  any  limit 

point  {p\.ki"P2.kPjk  ■  VA}  generated  by  Algorithm  AO  is  an  asymptotically  optimal  solution 
of  Problem  (2.13),  i.e., 

J  ({Pi,k,P2,k,9k})  -  J*  =  O  ,  (5-6) 

and  {pi,kiP2,k->9k  '■  VA:}  satisfies  the  power  constraints  in  (2.14-2.15). 

Please  refer  to  Appendix  8.6  for  the  proof. 

5.2  Asymptotically  Optimal  Solution  for  Large  Source-Relay 
Channel  Gain 

We  first  give  an  upper  bound  for  the  SNR  and  show  that  the  SNR  upper  bound 
can  be  approached  as  A  2  grows  large. 

Lemma  2  For  given  non-negative  pi,k,P2,k,  9k,  the  SNR  on  the  k-th  subcarrier  is  upper 
bounded  by  SNRfc  (pi,k,P2,k,  9k)  <  SNRj^2  {pi,k,P2,k,  Pk),  where 

SNR k2  (pi,k,P2,k,Pk)  =  h\  kPi,k+  {h3,kVPk  +  hhky/P2fi)2  , 
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and  pk  =  9kh\kPi,k-  Moreover,  for  sufficiently  large  h 2  and  given  non-negative  p±  k,g'k,  if 
9k  =  °  (^fe)  -  Pi,k  ~  Vi, k  =  O  and  g2h\kpxk  =  pk,  then 

SNR^  {P\,kiP2,ki  Pk)  -  SNR k  (p'hk,P2,k,g'k)  =  0  ■ 

Lemma  2  follows  straightforward  and  the  proof  is  omitted  for  conciseness. 

Using  Lemma  2,  we  can  obtain  an  asymptotically  optimal  solution  of  Problem 
(2.13)  for  large  h 2  by  solving  a  convex  optimization  problem  as  stated  in  the  following 
theorem. 

Theorem  4  (Asymptotically  optimal  solution  for  large  h2)  Let  [p0hk,plk,p°k  :  VA:} 
be  an  optimal  solution  of  the  following  convex  optimization  problem 


min  J, 

{pi,fc>0,P2,fe>0,Pfe>0:Vfc} 


N—l 


N- 1 


S.t.  Y,Pk<  P r,  ^2  (Pbfc  +P2,k)  <  Ps, 


k= 0 


k= 0 
N—l 


where  either  J  =  —minSNRk2  or  J  =  Jfk=Q  fk  (SNR^2).  Define 


9k  =\ 


^>0’ 


^2  ,fc  y  pfk 

Mi 

VM 


otherwise. 


Pi,k  =  < 


Pi,fc.  */Pi,fc  >  °> 


*2  ’ 


otherwise. 


(5.7) 


Then  for  sufficiently  large  h2,  j P\  k,P2  k,  9k  •  an  asymptotically  optimal  solution  of 

Problem  (2.13),  i.e., 

J  {{Pi,k,P2,k,9°k})  -  J*  =0  ,  (5.8) 
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and  Ef=  o1  (Plk+P°2,k) 


-P,<0 


O  {^2,kPl,k  +  -*■)  (fffc)  -Pr  <  O  (  h2  )  • 


Please  refer  to  Appendix  8.7  for  the  proof. 

Obviously,  if  we  use  the  k,p £  k,  gk j  in  Theorem  4  as  initial  point,  the  solution 
{Pi,kiP2,k-,9k  '■  VA;}  found  by  Algorithm  AO  will  be  an  asymptotically  optimal  solution  of 
Problem  (2.13),  i.e., 

J  ({Pi,k,P2,k,9k})  ~  J*  =  0  .  (5.9) 


29 


Chapter  6 

Implementation  Issue  and 
Simulation  Result 

6.1  Implementation  Issue 

In  a  real  system,  the  AO  algorithm  could  be  executed  at  the  source  node.  The 
global  channel  state  information  (CSI)  must  be  supplied  to  this  node.  In  relay  systems,  the 
source-relay /source-destination  and  relay-destination  channels  can  be  estimated  directly  at 
the  destination  node  by  utilizing  pilot  signals  [17].  The  destination  must  then  feedback  the 
CSI  to  the  source  node. 

•  Complexity  Analysis:  Here,  each  of  addition,  subtraction,  multiplication  and  divi¬ 
sion  is  counted  as  one  FLOP.  The  operations  for  comparisons  are  neglected. 

—  Complexity  of  Algorithm  1:  Step  1  requires  6N  FLOPs  (floating  point 
operation),  while  Step  2  and  3  together  require  21 A  FLOPs,  where  N  is  the 
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number  of  subcarrier.  Then  the  complexity  order  of  the  overall  bisection  search 
process  is  O  ^21Alog2  +  6A^,  where  fi*  is  the  length  of  the  search  range 
in  Algorithm  1  and  e  is  the  tolerable  error. 

Complexity  of  Algorithm  2:  Step  1  requires  8 A,  while  step  2  requires 

(21+Mi)A,  where  the  M\  additional  FLOPs  per  subcarrier  is  introduced  by  con¬ 
vex  function  /.  A  different  function  /  may  require  a  different  number  of  FLOPs  in 
calculating  /',  //_1  and  f" .  Given  n  <  //,  the  complexity  order  of  the  overall  bi¬ 
section  search  process  is  upper-bounded  by  O  ^(21  +  Mi)  A log2  log2  (i~)  +  8A 

Complexity  of  Algorithm  3:  Steps  1  through  3  require  46 A  FLOPs.  The 
search  ranges  for  9  and  7  are  denoted  by  Rg  and  f?7 ,  respectively.  The  complexity 
order  is  upper-bounded  by  O  ^37 A  +  9Alog2  l°g2  (t1)  +  46A^.  The 

complexity  order  increases  logarithmically  with  Rg  and  f?7. 

Complexity  of  Algorithm  4:  Step  1  through  2  require  40 A  +  M2,  where 
the  additional  M2  FLOPs  are  introduced  by  the  convex  function  /.  Then,  the 
complexity  order  is  upper-bounded  by 

O  ^  ^26A  +  M2  A  +  14A  log2 

x  log2  +  40A  +  M2n\  (6.1) 

Overall  Complexity  of  Algorithm  AO:  With  the  above  analysis,  one  can 
obtain  the  total  number  of  FLOPS  per  iteration  of  the  AO  algorithm.  Basically, 
the  complexity  grows  linearly  with  the  number  of  carrier  and  logarithmically 
with  precision  requirement.  The  number  of  iterations  of  the  AO  algorithm  is 
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difficult  to  quantify,  which  depends  on  the  stop  criterion.  In  our  simulation,  the 
required  number  of  iterations  to  achieve  |  AJ|  <  e/  with  ej  =  0.005  is  about  2-4, 
where  A J  is  the  difference  of  J  between  two  consecutive  iterations. 


6.2  Simulation  Results 

We  have  tested  Algorithms  1-4  individually.  They  all  converged  rapidly  to  the 
exact  optimal  solutions. 

For  problem  (2.13)  with  J  =  Ji,  we  apply  Algorithms  1  and  3  alternately  until 
convergence.  For  problem  (2.13)  with  J  =  J2,  we  apply  Algorithms  2  and  4  alternately  until 
convergence.  We  determine  the  convergence  when  both  p\  and  p 2  satisfy  \p[  — /J[  ^  |  <  10-5 
and  1/^2  — 1-1^2  1'*  I  <  10~5  where  r  is  the  current  index  of  alternation.  For  all  tested  cases,  the 
convergence  was  reached  within  three  to  six  alternations.  The  initialization  of  the  source 
power  allocation  was  chosen  to  be  uniform,  i.e.,  p\.k  =  P2,k  =  5^- 

The  optimization  of  the  permutation  function  k  is  computationally  costly.  In  our 
simulation,  we  have  compared  three  choices  of  k:  1)  k  =  k,  2)  k  is  such  that  /i3  j.  is  aligned 
with  hitk  V/c,  and  3)  k  is  such  that  h:i  j,  is  aligned  with  fi2,k  VA:.  Two  sequences  Si  and  S2 
are  said  to  be  aligned  if  the  largest  element  in  S\  is  located  at  the  same  position  as  the 
largest  element  in  S2 ,  the  second  largest  element  in  S 1  is  located  at  the  same  position  as  the 
second  largest  element  in  S2,  and  so  on.  We  have  found  that  the  third  choice  of  k  always 
yields  slightly  better  result  than  the  first  two.  Also  note  that  with  the  third  choice  of  k, 
the  overall  relay  channel  gains  defined  by  h,2,kh3  j.  VA:  have  a  larger  dynamics  than  /i2,fc^3,fc, 
which  intuitively  makes  the  relay  power  scheduling  more  effective.  In  the  following,  we  use 
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the  third  choice  of  A. 


We  generated  the  channel  parameters  hi^,  h-z.k  and  VA  as  independent  circular 
complex  Gaussian  random  variables  of  zero  means.  We  choose  and  h 3^  VA  to  have  the 
variance  two,  and  h\  ^  VA  to  have  the  variance  2  x  IO10.  When  a  =  0,  the  direct  link  has 
the  same  strength  as  the  relay  links.  As  a  decreases  from  zero,  the  direct  link  weakens.  The 
value  of  a  measures  a  relative  (averaged)  strength  in  dB  of  the  direct  link  over  the  relay 
links. 

In  Fig.  6.1,  we  show  the  minimum  capacity  among  all  subchannels  versus  a. 
This  capacity  was  averaged  over  ten  independent  channel  realizations.  For  each  channel 
realization  (of  hgfc,  h-2,k  and  h,3.fc  VA),  Algorithms  1  and  3  were  used  to  maximize  min*,  SNRfc 
(or  minimize  —  min*,  SNR*;).  The  resulting  min/,,  SNR,/;  is  used  to  determine  the  minimum 
capacity  (in  bits/s/Hz)  by  \  log2 ( 1  +  minfcSNRfc).  We  see  a  significant  gap  of  capacity 
between  the  case  of  free  (optimal)  r//c  and  the  case  of  r]k  =  0  when  the  direct  link  is  not 
much  weaker  than  the  relay  links.  When  the  direct  link  becomes  much  weaker  than  the 
relay  links,  the  gap  diminishes  as  expected. 

The  distribution  of  the  optimal  log10  %  VA  determined  by  Algorithms  1  and  3  for 
a  typical  channel  realization  is  shown  in  Fig.  6.2.  As  predicted  by  the  analysis  shown  in 
Section  4.3,  when  the  direct  link  becomes  weaker  ( a  smaller),  approaches  to  zero.  And 
when  the  direct  link  becomes  stronger  (a  larger),  rjk  is  either  very  large  or  very  small.  For 
the  plot,  any  value  of  log10  rjk  larger  than  4  is  set  to  4,  and  any  value  of  log10  %  less  than 
-4  is  set  to  -4. 

Fig.  6.3  shows  the  averaged  subchannel  capacity  determined  by  Algorithms  2  and 
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Channel  strength  ratio  a  (dB) 


Figure  6.1:  Minimum  subchannel  capacity  determined  by  Algorithms  1  and  3  (for  J±)  with 
N  =  300  and  Ps  =  Pr  =  50.  The  scheme  with  free  is  based  on  Algorithms  1  and  3  with 
no  constraint  on  rjk-  The  scheme  with  i~jk  =  0  is  based  on  Algorithms  1  and  3  with  rjk  =  0. 


4  versus  a.  For  each  channel  realization,  Algorithms  2  and  4  were  used  to  minimize  J2 
shown  in  (2.12).  The  resulting  —J2  was  further  averaged  over  ten  independent  channel 
realizations  and  then  plotted  into  this  figure.  Once  again,  we  see  an  important  gap  between 
the  case  of  free  (optimal)  r]k  and  the  case  of  rjk  =  0  when  the  direct  link  is  not  very  weak. 

The  distribution  of  the  optimal  rjk  Vfc  determined  by  Algorithms  2  and  4  for  a 
typical  channel  realization  is  shown  in  Fig.  6.4.  The  general  trends  of  the  values  of  rjk  as  a 
increases  or  decreases  are  consistent  with  the  analysis  shown  in  Section  4.3. 

Comparing  the  two  costs  Ji  and  J2,  or  equivalently,  comparing  Figs.  6.1  and  6.2 
versus  Figs.  6.3  and  6.4,  we  see  that  allowing  rjk  to  be  nonzero  is  more  important  for  J\ 
than  for  J2,  i.e.,  the  capacity  gap  for  J\  is  larger  than  that  for  J2.  It  takes  a  much  weaker 
direct  link,  for  J\  than  for  J2,  that  rjk  approaches  to  zero.  We  also  see  that  the  distribution 
of  r)k  becomes  more  binary  for  ,J\  than  for  J2  as  the  direct  link  becomes  stronger. 
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Figure  6.2:  Distribution  of  the  optimal  %  \/k  determined  by  Algorithms  1  and  3  (for  J\) 
with  N  =  300  and  Ps  =  Pr  =  50. 


Figure  6.3:  Average  subchannel  capacity  determined  by  Algorithms  2  and  4  (for  J2)  with 
N  =  300  and  Ps  =  Pr  =  50.  The  scheme  with  free  i]k  is  that  by  Algorithms  2  and  4  with  no 
constraint  on  r//c ,  the  scheme  with  r]k  =  0  is  that  by  Algorithms  2  and  4  with  rjk  =  0,  and 
the  scheme  in  [1]  is  the  WF-MCAF  method  in  [1]  which  ignores  the  direct  link  completely. 
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Channel  strength  ratio  a  (dB) 


Figure  6.4:  Distribution  of  optimal  rjk  VA;  determined  by  Algorithms  2  and  4  (for  J2)  with 
N  =  300  and  Ps  =  Pr  =  50. 
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Chapter  7 


Conclusion 


We  have  studied  a  dual-phase  power  allocation  problem  for  a  multicarrier  relay 
system  with  direct  link.  We  have  developed  four  efficient  algorithms  that  yield  the  exact  so¬ 
lutions  to  the  four  corresponding  optimization  problems,  including  two  that  are  not  convex. 
Unlike  the  previous  works,  we  allow  the  source  power  to  be  distributed  in  both  phases  for 
each  carrier.  Our  analysis  and  simulation  show  that  this  additional  freedom  is  important 
when  the  direct  link  is  not  too  weak  compared  to  the  relay  links. 
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Chapter  8 

Proof  of  Lemma  and  Theorem 

8.1  Proof  of  Proposition  1 

1.  It  follows  from  (/SNfr  =  ^SNRfc  and  (3.3)  that  cZS,N?*  =  oo  when  gk  =  0,  and  < 

dg%  ^gkdgk  \  >  dg*  ’  dg* 

oo  when  gk  >  0.  Also  note  that  the  left-side  function  of  the  power  constraint  (3.2)  is  a 
(finitely)  weighted  sum  of  g\.  As  a  result,  if  there  is  a  gk  =  0,  we  can  always  increase 
gk  and  decrease  a  positive  g}j  without  affecting  the  power  constraint  but  decreasing 
the  cost  J2.  Therefore,  the  optimal  gk  for  the  cost  J2  must  be  positive  for  all  k,  and 
hence  7^  =  0,  V/c  G  K. o- 

2.  Since  the  optimal  gk  (also  denoted  by  gk)  to  Problem  (3.1)  must  fall  in  [0,  g^]  where 
J-2  is  decreasing  with  gk,  the  optimal  gk  Vfe  must  be  such  that  either  (3.2)  holds  with 
9k  =  9k  VA;  or  the  equality  in  (3.2)  holds  with  gk  <  g*k  for  at  least  one  k. 
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3.  It  follows  from  (3.3)  that 


dSNRfc 
2 9kdgk 


(h 3,kh2,k9k  +  hlkr)k ) 

- 2 - -  X 

9k 


(h,kh2,k  -  Vkhl  ^m) 
(hlk9k  +  !)2 


~Pl,k, 


(8.1) 


where  both  factors  in  the  right-hand  side  are  positive  and  decreasing  with  e  [0,^]. 
Since  fk(x)  is  a  decreasing  convex  function,  —f'k  is  also  positive  and  decreasing  with 
gk  €E  [0 ,gk].  For  a  given  feasible  g,  the  corresponding  g^  for  each  k  can  be  easily  found 
by  a  bisection  search  based  on  (3.7),  the  solution  of  which  is  denoted  by  9k(g)- 


8.2  Proof  of  Proposition  2 


The  constraint  (2.15)  may  or  may  not  be  active  (i.e.,  the  equality  may  or  may  not 
hold  at  the  optimal  solution).  If  the  solution  to  all  the  KKT  conditions,  except  with  g2  =  0, 
satisfies  (2.15)  with  equality,  then  /j<2  =  0  is  optimal.  Otherwise,  we  need  to  have  p2  >  0. 

Let  ai  =  gi  and  <22  =  /^gfJi'l  k  +  Mi-  Then,  for  each  pair  of  g\  and  g-2,  (4.5)  and 
(4.6)  become 


.  V  k  a  2 

r'I  +  s,IT_A;' 


1  ai 

t-k  +  Sk  - -  =  T  • 

2  9k  a k 


(8.2) 

(8.3) 


Taking  the  ratio  of  the  above  two  equations  leads  to  a  quadratic  equation  in  terms  of  rg , 


from  which  we  have  one  unique  (positive)  solution  in  (4.9). 

One  can  verify  from  (4.9)  that  gk  is  a  monotonically  increasing  function  of  ^ ,  i.e., 


>  0. 
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For  any  given  r/k  in  =  Vky/Pi^k,  we  have  from  (4.4)  that  SNRfc  =  (rk  +  rjksk  + 


Vk^k)pi,k  and  hence  for  each  given  9  we  can  choose 


pi,k  =  — ~ - ,  2+  (8-4) 

rk  +  VkSk  +  vftk 


to  achieve  SNR*.  =  9  \/k. 


The  search  for  9  should  be  such  that  the  equality  in  (2.14)  is  satisfied.  Obviously, 


the  left  side  of  (2.14)  is  also  a  monotonically  increasing  function  of  9  subject  to  given  rjk 


Vfc. 


If  the  inequality  in  (2.15)  is  satisfied  by  the  optimal  solution  to  (4.2),  ^  =  0  is 
optimal.  Otherwise,  we  must  have  —  >  0.  So,  we  should  start  the  search  for  —  from 
=  0.  The  left  side  of  (2.15)  is  a  decreasing  function  of  ^  subject  to  (4.9)  and  the 
equality  in  (2.14). 

The  expression  of  r/k  in  (4.9)  is  numerically  unstable  when  sk  is  small  (especially 
since  gk,  and  hence  sk,  for  some  k  can  be  exactly  zero).  To  solve  the  numerical  problem, 
we  need  to  consider  the  case  when  sk  is  arbitrarily  small.  As  sk  — >  0,  we  can  use  the  Taylor 
series  expansion  of  (4.9)  and  write  that 


9k  =  < 


sk ^ 

a± 

2  (tk^-rky 


OO, 


0, 


tk  — 

K  a  i 


tk~ 

K  a  i 


rk  >  0 
rk  <  0 


(8.5) 


tk^-rk  =  0 


and  using  this  in  (8.4)  leads  to  (4.7)  and  (4.8). 
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8.3  Proof  of  Proposition  3 


The  (4.12)  and  (4.13)  are  equivalent  to 


V  k  a  2 

rk  +  Sk  —  —  77 ? 

z  h 

1  aq 

tk  +  Sk~ —  =  —  77, 
2rlk  fk 


(8.6) 

(8.7) 


which  are  virtually  the  same  as  (8.2)  and  (8.3). 

Taking  the  ratio  of  (8.6)  and  (8.7)  and  solving  the  resulting  quadratic  equation  in 
terms  of  r/k ,  we  have  the  same  solution  as  shown  in  (4.9).  Hence,  for  any  given  ^ ,  we  can 
find  r)k  \/k  from  (4.9). 

Since  /(.  is  monotonically  increasing  with  SNRj, ,  it  is  also  nronotonically  increasing 
with  p\,k  subject  to  ^P2,k  =  Vky/Pifi-  Hence,  if  we  are  also  given  p\  =  a\  (in  addition  to 
7),  we  can  find  from  (8.7)  a  unique  pitk,  and  then  the  corresponding  P2,k,  V7c.  Specifically, 
we  recall  SNR*,  =  (rk  +  rjkSk  +  Vk^k)pi,k,  and  hence  have  from  (8.7)  that  fk  =  /(.( SNR*,)  = 


fk((rk  +  PkSk  +  ilktk)pi,k)  = 


—  "Mi 
t,+_£AL 
tk+2Vk 


.  Therefore, 


Pi,k  = 


ft- 1  I  "AO 

■Ik  l  ^  4.  sk 
fe+2»fe 


Tk  +  VkSk  +  vltk 


(8.8) 


where  f'k  l(x)  is  the  inverse  function  of  fk (x),  and  (x)+  =  max(x,0). 

It  is  clear  from  (8.8)  that  Pi,fc,P2,fc>  increase  as  deacrese,  subject  to  any  given 
7.  So,  we  can  use  the  bisection  search  to  determine  such  that  the  equality  in  (2.14) 
holds. 

The  equality  in  (2.15)  may  or  may  not  be  satisfied  by  the  optimal  solution  subject 
to  7  =  1  (or  equivalently  P2  =  0).  If  it  is  satisfied,  7  =  1  is  optimal  and  no  further  search 
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is  needed.  Otherwise,  we  need  to  increase  7.  Since 


increases  with  7,  the  left  side  of 

Pi.fc  n 


(2.15)  is  monotonically  decreasing  with  7  subject  to  the  equality  in  (2.14).  Hence,  we  can 
use  the  bisection  search  to  find  7  such  that  the  equality  in  (2.15)  is  satisfied. 

Similar  to  the  discussions  for  (4.7)  and  (4.8),  as  Sk  — >  0,  we  should  apply  (8.5)  to 
(8.8),  which  yields 


and 


Pi,k  =  < 


f' 

Jk 


P2,k  =  < 


0, 
_  02 


Ml 


\2 

-rk) 

rk 

r 

rk+^tk 

1 

(«■'(¥) 

r 

tk 

5 

0, 

(A-‘(V7) 

)+ 

^rk+tk 


tkal  rk  >  0 


tk^-rk<  0 


tk%~rk  =  0 


02 
z  a\ 


tk  77  -  rk  <  0 


J  tkal  Tk  —  ^ 


(8.8),  (8.9)  and  (8.10)  together  conclude  the  proof. 


(8.9) 


(8.10) 


8.4  Proof  of  Lemma  1 

For  fixed  Pitk,P2,k,  we  have  SNRfc  <  SNR))  =  p1>k  (h\  k  +  h\  ^j  +  P2,kh\  Then 
for  fixed  p\jk  +  P2,k  =  Pk,  the  SNR  upper  bounded  can  be  obtained  by  solving 

max  SNR*k,  s.t.  pi,k  +  p2,k  =  Pk.  (8.11) 

Pl,k,P2,k 

It  is  easy  to  see  that  the  optimal  solution  of  (8.11)  is  given  by  (5. 1-5.2)  and  the  optimal 
value  is  SNRj^11.  Substituting  (5. 1-5.3)  into  the  SNR  expression  in  (2.9),  it  can  be  verified 
that  SNR^1  -  SNRfc  =  O 
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8.5  Proof  of  Theorem  2 


By  Lemma  1,  we  have 

J(Pk)<J *•  (8-12) 

SNRUi  (po}  _  SNRfc  (p°fcjpofc^o)  =  0  QJ  .  (8.13) 

From  (8.13),  we  have 

J(Pfc°)  -  =  O  (-0  .  (8.14) 

Then  (5.5)  follows  immediately  from  (8.12)  and  (8.14).  Finally,  using  the  definition  of 
{^l  k'P2  k'  9k  ■  if  can  be  verified  that  jp)1  k,p2  k,  gk  :  Vfc  j  also  satisfies  the  power  con¬ 
straints  in  (2.14-2.15). 

8.6  Proof  of  Theorem  3 

Using  the  KKT  condition  in  (3.7),  it  can  be  shown  that  for  any  initial  point 
{Pi,k,P2,k}  that  satisfies  the  condition  in  Theorem  3,  the  optimal  gk  ^  obtained  in  step  1  of 
Algorithm  AO  satisfies  gk  ^  =  O  it  k  +  h\  k  <  Ii*  and  gk  1  =  O  (\/Ri)  otherwise. 

Following  similar  analysis  as  in  the  proof  of  Lemma  1 ,  it  can  be  shown  that 

j({j>;,*,P2,*,41>})  =o(L).  (8.15) 

By  Lemma  1,  we  have 

J{{P°k})<J*-  (8-16) 

For  fixed  g^\  the  optimal  jp[ obtained  in  step  2  of  Algorithm  AO  satisfies 

J  <  J  ({pl,kiP02,kT9k]})  •  (8-17) 
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From  (8.15-8.17),  we  have  J  ~  J*  =  O  (i^)’  he.,  after  the  first  iteration 

of  Algorithm  AO,  the  solution  is  already  asymptotically  optimal  for  large  Pr.  The  rest 
iterations  will  only  decrease  the  cost  function.  This  completes  the  proof. 

8.7  Proof  of  Theorem  4 

Using  Lemma  2  and  the  fact  that  the  constraint  J2k=o  Pk  —  pr  hr  (5-7)  is  a  more 
relaxed  relay  power  constraint  compared  to  the  original  one  in  (2.15),  we  have 

j{pik,P°2,k,P°k)<J*-  (8-18) 

Note  that  (gk)2  /<|  k'P°  k  =  P°k •  Then  it  follows  from  Lemma  2,  gk  =  O  ^  andp)1  k~p\  k  = 
O  that 

SNRf  -  SNRfc  {plk,p°2tk,g°k)  =  O  (^)  .  (8.19) 

From  (8.19),  we  have 

J  (Pl,kiP2,ki  Pk)  ~~  T  {{Pl,k’P2,ki9k})  =  ®  '  (8.20) 

Then  (5.8)  follows  immediately  from  (8.18)  and  (8.20).  Finally,  using  g°k  =  O  ^-^7=^  > 
Pi  k  ~  Pi  k  =  ®  (ui),  an<^  ^he  definition  of  k,P2  k->  P°k  :  Vfc|,  it  can  be  verified  that 
{Pi,fc’p2,fc^fc  :  Vfc}  satisfies  o  (p°i,k  +  P°2,k)  ~  Ps  <  O  (^), 
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Part  II 

A  Practical  Exploration  of 
Full-Duplex  Radio  Communication 


45 


Chapter  9 


Introduction 


Currently  the  radio  used  in  cellular  communication,  radio,  microwave  back-haul, 
WiFi,  bluetooth  technology  are  half-duplex.  Either  a  frequency-devision  (FD),  tinre-devision 
(TD),  or  other  division,  such  as  code-division  (CD),  is  used  for  multiple  access.  Nowadays 
the  spectrum  become  more  and  more  previous  as  the  demands  increases.  Many  researches 
have  been  done  to  improvement  the  management  of  spectrum.  Traditionally,  the  sub¬ 
channel  used  for  transmitting  is  different  from  the  sub-channel  used  for  receiving  due  to 
self-interference.  Full-duplex,  a  new  approach  with  theoretically  double  efficiency  of  spec¬ 
trum,  has  received  growing  interests  in  research  and  commercial  applications  [18,  19].  Full- 
duplex  will  bring  a  great  advantage  over  half-duplex  in  terms  of  capacity  and  interference 
coordination,  etc  and  has  the  potential  for  sustain  the  LTE  5G  [20] .  The  key  to  full-duplex 
is  to  remove  the  self-interference  caused  by  the  local  transmitter,  referred  as  self-interference 
cancellation  (SIC).  One  can  cancel  the  self-interference  using  the  following  techniques: 

•  Passive  Cancellation.  Passive  cancellation  includes  antenna  separation  and  fixed  an- 
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tenna  null.  Obviously  the  more  attenuation  between  the  receiver  and  the  local  trans¬ 


mitter,  better  the  cancellation.  However,  the  cancellation  is  at  the  expense  of  increas¬ 
ing  the  size  of  the  nodes,  which  is  limited  in  practice. 

•  Active  Digital  cancellation.  This  technique  are  DSP  based  cancellation  achieved  in 
digital  domain.  If  only  active  digital  cancellation  is  applied,  strong  self-interference 
will  still  be  presented  at  the  LNA  and  the  ADC.  The  saturation  of  LNA  and  the 
reduction  of  dynamic  range  of  ADC  limit  the  performance  of  full-duplex  link  capacity, 
or  even  destroy  the  connection. 

•  Active  analog  cancellation.  This  cancellation  is  aimed  to  cancel  the  self-interference 
before  the  digital  domain  to  prevent  the  saturation  or  dynamic  range  wastes.  This 
method  has  a  higher  implementation  cost  comparing  to  active  digital  cancellation  due 
to  the  difficulties  to  manipulating  signal  in  analog  domain. 

These  three  main  techniques  are  not  mutually  exclusive.  In  fact,  the  best  result  is  achieved 
using  the  combination  of  them.  Previously  several  research  are  conducted  to  verify  the 
performance  of  each  technique  and  the  combination  of  them.  Self-interference  cancellation 
achieved  by  antenna  placement  are  discussed  in  [21,  22,  23,  24,  25,  26].  In  [21,  22,  23],  the 
antenna  were  specifically  placed  to  create  a  pair  of  path  with  same  attenuation  and  opposite 
phase,  resulting  in  a  RF  null  at  receiver.  [24]  proposed  a  relay  antenna  domain  isolation 
technique,  [25]  was  based  on  antenna  mutual  coupling  model  and  [26]  used  a  directional 
antenna  to  achieve  the  null  at  the  receiver. 

In  [27],  noise  cancellation  chip  Intersil  QHX220  was  used  for  digital  active  cancellation. 
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However,  the  cancellation  was  limited  by  the  dynamic  range.  In  [28],  the  passive  cancel¬ 
lation  was  used  to  break  this  limits.  The  active  analog  cancellation  was  adopted  in  [21] 
with  two  additional  antenna.  To  reduce  the  additional  hardware  requirement,  in  [29],  a 
design  with  two  antenna  full-duplex  radio  was  proposed  using  a  Baiun  circuit.  Though 
this  method  didn’t  have  a  bandwidth  constrain,  the  performance  degraded  as  the  band¬ 
width  increases.  This  limits  the  application  of  the  proposed  structure  in  a  scenario  where 
the  self-interference  channel  is  frequency  selective.  Another  method  for  frequency  selec¬ 
tive  self-interference  channel  method  was  proposed  in  [30]  for  orthogonal  frequency  division 
multiplexing  (OFDM)  system.  This  method  processed  the  cancellation  in  digital  domain 
and  the  cancellation  happened  in  analog  domain.  In  [31],  we  proposed  a  more  general 
technique  for  any  system  with  this  advantage.  An  mathematically  analysis  of  20MHz  FD 
OFDM  wireless  system  was  presented  in  [32],  [33]  proposed  an  analytical  optimal  solution 
for  wideband  multitap  self-interference  channel  cancellation. 

For  active  analog  cancellation,  [34,  35,  36]  proposed  open  loop  techniques  on  narrow  or 
wide  band.  The  open-loop  made  the  system  vulnerable  to  the  transmitter  and  receiver 
impairment.  A  more  robust  approach  was  proposed  in  [37]  using  closed-loop.  Since  the 
self-interference  was  much  stronger  than  the  desired  signal,  an  iterative  cancellation  would 
improve  the  robustness  to  the  error  estimation  of  the  self-interference  channel.  A  two-  stage 
cancellation  was  proposed  in  [38].  While  the  above  method  were  using  multiple  antenna  (or 
directional  antenna)  for  active  analog  cancellation,  [39]  proposed  a  single  antenna  solution. 
[40]  also  proposed  a  single  antenna  solution  but  only  allowed  simultaneous  transmitting 
and  reception  on  different  adjacent  channel/subcarrier.  In  [41],  an  110  dB  cancellation 
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was  achieved  by  a  combination  of  digital  and  analog  cancellation  technique.  The  proposed 


method  had  a  significant  less  cancellation  in  strongly  frequency  selective  self-interference 
channel. 

MIMO  system  that  support  multiple  streams  were  considered  in  [42,  43,  44,  45,  46,  31].  The 
possibility  of  full-duplex  MIMO  was  discussed  in  [42],  A  20  MHz  full-duplex  MIMO  OFDM 
WiFi  radio  was  proposed  in  [43].  An  mixture  of  SISO  and  MIMO  scenario  was  discussed 
in  [44],  [45]  also  presented  a  full-duplex  WiFi  radio  with  complexity  scales  linearly  with 
the  number  of  antenna.  [46]  presented  a  solution  that  can  flexibly  select  the  number  of  RF 
chains  based  on  the  network  topology.  In  [31],  we  proposed  a  time  domain  transmit  beam¬ 
forming  method  without  the  assumption  of  the  frequency-flat  self-interference  channel.  This 
method  also  got  rid  of  the  prefix-region  problem  associated  with  the  OFDM  beamforming 
method. 

While  most  of  the  active  analog  method  weree  cancel  the  self-interference  at  the  RF  stage, 
[47,  48,  49,  50,  51]  discussed  the  analog  baseband  cancellation.  [47]  showed  a  significant  im¬ 
prove  with  an  additional  complementary  analog  baseband  cancellation  stage.  [48]  fed  back 
the  cancellation  signal  before  the  input  of  ADC.  [49]  used  an  adaptive  auxiliary  transmitter. 
[50,  51]  implemented  an  adaptive  filter  using  the  information  from  digital  domain.  In  [52], 
we  categorized  them  into  1)  all-analog  method,  such  as  [21,  40,  33,  41,  37],  2)  all-digital 
method,  such  as  [48]  and  3)  hybrid  method,  such  as  [28,  36,  31,  52],  In  an  all-analog  method, 
RF  interference  was  canceled  at  RF  front-end  by  using  its  RF  interference  source.  In  an  all- 
digital  method,  interference  was  canceled  only  after  the  RF  interference  had  been  converted 
into  baseband  and  digitized.  A  hybrid  method  might  use  either  a  baseband-controlled  trans- 
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mit  beamforming  method  to  cancel  interference  at  the  RF  front-end  of  receivers  [31]  or  use 
the  RF  signals  tapped  from  the  output  of  RF  power  amplifiers  to  cancel  the  interference 
after  down-conversion  to  a  lower  frequency  [52].  In  Figure  9.1,  four  approaches  for  SIC 
critical  for  full-duplex  radio  are  presents.  The  dash  lines  denote  the  cancellation  paths. 
The  all-analog  path  denotes  a  path  with  analog  input  interface,  analog  filter  and  analog 
output  interface.  The  all-digital  path  denotes  a  path  with  digital  input  interface,  digital 
filter  and  digital  output  interface.  The  hybrid-1  path  denotes  a  path  with  digital  input 
interface,  digital  filter  and  analog  output  interface.  The  hybrid-2  path  denotes  a  path  with 
analog  input  interface,  digital  filter  and  analog  output  interface.  The  two  antennas  can  be 
merged  into  one  by  using  a  circulator. 


Figure  9.1:  Four  approaches  for  self-interference  cancellation. 

Analog  active  cancellation  is  affected  by  the  impairment  of  the  transmitter  and  re¬ 
ceiver  such  as  phase  noise,  non-linear  distortion  and  thermal  noise.  This  had  been  an  bottle 


50 


neck  for  full-duplex  in  the  past.  [53,  54,  55,  56]  showed  that  the  effect  caused  by  the  phase 
noise  could  also  be  mitigated.  [53]  showed  that  the  main  bottle  neck  in  self-interference 
cancellation  was  the  local  oscillator  phase  noise.  [54]  considered  both  shared  oscillator  and 
independent  ones.  [55]  developed  a  digital  domain  cancellation  under  oscillator  phase  noise. 
[56]  investigated  the  amplify-and-forward  repeater  link  in  the  presents  of  the  oscillator  phase 
noise.  In  [52],  we  proposed  a  new  analog-digital  hybrid  method  without  the  limitation  of 
the  transmitting  noise.  A  blind  system  identification  and  equalization  algorithm  for  finding 
the  optimal  configuration  was  discussed  and  provided.  [57,  58,  59,  60,  61]  considered  the 
effect  caused  by  nonlinear  distortion.  [60]  provided  a  comprehensive  analysis  of  the  effects 
of  nonlinear  distortion  in  transmitter  power  amplifier.  [61]  analyzed  the  performance  of 
full-duplex  MIMO  OFDM  transceivers  under  non-ideal  ADCs.  In  [62],  impact  of  thermal 
noise  on  full-duplex  radio  was  analyzed.  Tabel  9.1  shows  the  topic  covered  by  each  paper 
We  have  proposed  novel  architecture  for  all-analog  cancellation  using  a  clustered 
taps  in  [63]  and  compared  its  performance  with  the  uniformly  distributed  RF  attnuator  in 
[41,  37].  This  method  method  would  taken  care  of  all  the  impairment  of  the  transmitter, 
which  is  the  main  source  of  noise.  The  following  section  shows  the  detail  of  our  contribu- 


Category 

Reference 

Passive  Cancellation 

[21,  22,  23,  24,  25,  26,  27,  28,  34,  43] 

Active  Digital  Cancellation 

[20,  21,  27,  28,  29,  30,  31,  32,  33,  34,  35,  36,  37,  38] 
[39,  41,  43,  45,  46,  47,  52,  61] 

Actice  Analog  Cancellation 

[21,  28,  29,  34,  35,  38,  41,  43,  45,  46,  48,  49,  50,  51] 
[55,  58,  59,  61] 

Table  9.1:  Topics  covered  by  the  reference  paper. 
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tion.  First,  the  transmitting  beamforming  method  is  discussed  and  the  experiment  data  is 


presented,  second  a  new  adaptive  method  is  presented  and  the  algorithm  associate  with  it 
is  also  demonstrated. 
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Chapter  10 


Transmitting  Beamfoming  Method 


Since  the  transmitted  signal  are  known  and  the  interference  channel  can  be  es¬ 
timated,  we  can  subtract  the  self-interference  from  the  received  waveform.  It  is  the  most 
straightforward  method  to  remove  the  self-interference.  The  model  can  be  expressed  as: 

y[n]  =  r[n]  —  i[n]  =  x[n]  *  hi[n ]  +  d[n\  *  hd[n]  —  x[n]  *  hi[n ]  (10-1) 

where  r[n]  is  the  received  signal,  i [n]  is  the  estimated  interference  signal,  x[n\  is  the  trans¬ 
mitted  signal,  d[n]  is  the  desired  remote  signal,  hi[n\  is  the  self-interference  channel,  hd[n\ 
is  the  remote  channel  and  the  ht  \ri\  is  the  estimated  self-interference  channel  in  baseband 
complex  form.  With  an  accurate  enough  estimation  of  the  self-interference  channel,  we 
can  achieve  full-duplex  radio  communication.  However,  the  cancellation  happens  in  digital 
domain,  which  means  that  the  strong  self-interference  will  be  presented  at  the  front  end 
of  the  LNA  and  ADC.  This  will  caused  the  saturation  of  LNA.  Even  the  the  LNA  is  not 
saturated,  the  dynamic  range  of  ADC  will  be  great  reduced.  For  example,  assume  we  have 
two  communication  nodes  with  channel  strength  shown  in  Figure  10.1.  The  path  loss  (with 
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everything  lumped  together)  between  the  transmitter  and  the  remote  receiver  is  90  dB,  the 
path  loss  between  the  transmitter  and  the  local  receiver  is  40dB,  both  receivers,  remote  or 
the  local  ones,  require  20dB  to  decode,  and  has  a  noise  floor  at  -lOOdBm.  Therefore  to 
ensure  the  SNR  of  both  the  receiver,  the  receiving  signal  should  at  least  be  -80dBm,  with 
the  transmitting  power  at  lOdBm  and  the  self-interference  power  at  -30dBm.  In  this  case, 
the  self-interference  is  50dB  larger  than  the  desired  signal.  This  will  saturate  the  amplifier 
and/or  waste  the  dynamic  range  of  ADC  as  we  will  discussed  in  detail  later.  An  example  of 
power  level  at  the  receiver  is  shown  in  Figure  10.2.  As  we  can  see  the  desired  remote  signal 
is  deeply  buried  by  the  self-interference. 


Figure  10.1:  Remote  Channel  Gain  vs.  Self-Interference  Channel  Gain 
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Figure  10.2:  Power  levels  at  receiver. 


To  achieve  fullduplex,  interference  has  to  be  canceled.  Initially,  digital  cancellation 
is  applied  and  cancellation  is  limited.  This  is  due  to  the  saturation  power  of  the  front-end 
LNA  amplifier  is  limited.  Then  RF/analog  domain  cancellation  is  studied.  We  have  to 
cancel  the  interference  before  the  input  of  the  LNA  amplifier,  which  can  not  be  achieved 
by  digital  cancellation.  A  variety  structures  have  been  designed  for  the  RF/analog  domain 
cancellation,  so  are  the  corresponding  control  algorithm  for  the  structures.  As  shown  in 
Figure  10.2,  the  self-interference  is  50dB  stronger  than  the  desired  remote  signal.  Assume 
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we  are  using  a  14-bit  ADC,  the  whole  dynamic  range  is 


,  fMaxValue\  ,  214  —  1 


2° 


MdB 


(10.2) 


By  subtracting  the  self-interference,  we  removed  the  larger  50  dB  range  from  the  dynamic 
range,  therefore  leaving  34dB  for  decoding  desired  remote  signal,  which  is  equivalent  to 
N  =  5.6  bits  ADC  without  self-interference,  as  shown  in  Equ.  (10.3) 


N  =  log2  10 


34x0.05 


5.6 


(10.3) 


With  much  less  digits  available  in  the  ADC,  the  quantization  error  become  significant 
and  the  whole  communication  is  compromised.  In  reality,  sometime  self-interference  could 
be  more  than  100  dB  higher  than  the  desired  signal.  It  is  not  practical  to  use  a  digital 
interference  removal  to  achieve  full-duplex  radio  alone. 


10.1  Structure  and  Procedure  in  SISO  Mode 

As  stated  earlier,  digital  interference  cancellation  is  not  practical  due  to  the  satu¬ 
ration  of  LNA  and  reduction  of  the  dynamic  range  of  ADC.  In  [31],  we  presented  a  method 
for  full  duplex  Radio.  This  method  cancels  the  self-interference  at  antenna  to  avoid  satu¬ 
ration  effect.  The  fullduplex  needs  an  additional  transmitting  antenna  to  cancel  the  self¬ 
interference.  This  antenna  is  called  cancelmitter  (Cx).  The  full-duplex  node  has  one  trans¬ 
mitter  (Tx),  one  cancelmitter  (Cx)  and  one  receiver  (Rx).  The  hardware  experimental 
schematic  is  shown  in  Figure  10.3,  where,  two  Signal  Generator,  Agilent  MXG  N5182A 
(SGI  and  SG2),  and  one  Signal  Analyzer,  Agilent  MXA  N9020A  (SA)  are  deployed  as  Cx, 
Tx  and  Rx,  respectively. 
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A1 


Figure  10.3:  Schematic  of  experiment  using  Agilent  MXA  N9020A  and  MXG  N5182A  for 
Full  Duplex  Hardware  Experiment.  Two  MXG  and  one  MXA  are  used  as  Tx,  Cx  and  Rx. 


During  channel  estimation  phase,  broadband  signal  is  transmitted  from  Tx  and 
Rx.  The  received  signal  is  processed  for  channel  estimation.  During  full-duplex  phase, 
according  to  the  estimated  channel,  a  transmitting  beamforming  method  is  applied  to  make 
an  null  at  the  Rx.  Another  important  issue  is  the  alignment.  The  interference  signal  and 
the  cancel  signal  should  arrive  the  receiver  at  the  same  time.  This  requires  additional  effort 
to  record  the  delay  of  each  path.  The  accuracy  of  the  alignment  is  critical  and  sometimes  is 
smaller  than  the  sample  interval.  Though  we  can  align  the  signal  using  fractional  sampling 
method,  a  novel  procedure  without  alignment  is  proposed  in  [31]  as: 

1  Tx  transmits  a  delta  signal  p[n]  and  Cx  transmits  p[n],  Rx  captures  it  as  p\[n]\ 

2  Tx  transmits  p[n]  and  Cx  transmits  —p[n\  and  Rx  captures  them  using  the  same  delay 
as  p2  [n\ ; 

3  To  transmit  s[n]  in  full-duplex  mode,  Tx  transmits  s[n]  *  ( P2[n }  —  p\[n\)  and  Cx 
transmits  s  [n]  *  ( p\  [n]  +  p2  [n] ) 
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By  using  this  procedure,  interference  and  the  cancellation  goes  through  self-interference 


channel  and  cancellation  channel  in  different  orders.  After  permutation,  they  both  go 
through  the  same  channel  therefore  all  the  delay  are  identical.  We  can  show  that 

Pi[n]  =  p[n]  *  h\[n]  +  p[n]  *  /12M  (10-4) 

P2[n\  =  p[n]  *  hi[n\  —  p[n\  *  h,2[n]  (10.5) 

where  h  1  [n]  is  the  baseband  complex  channel  between  Tx  and  Rx,  li2  [n]  is  the  baseband 
complex  channel  between  Cx  and  Rx.  The  interference  at  the  Rx  is  expressed  as: 

I[n\  =  -s[n]  *  (p2[n\  -  pi[n\)  *  h\[n ]  +  s[n]  *  (pi[n\  +  P2N)  *  /12H 

=  2s[n]  *  (—  p[n]  *  /i2[n]  *  h\[n\  —  p[n]  *  h\[n ]  *  h2[n])  =  0  (10.6) 

The  transmitted  full-duplex  radio  signal  goes  through  an  additional  pre- filter  P2  [n]  +  p\  [n] 
or  P2[n]  —pi[n].  This  length  of  the  filter  is  small  since  it’s  a  response  of  the  self-interference 
channel,  and  can  be  lumped  into  the  real  channel.  Therefore  this  additional  filter  won’t 
affect  the  reception  and  decoding  for  the  remote  receiving  node.  The  reception  of  the  remote 
signal  is  not  affected. 

10.2  Pre-Filter  Design  in  MIMO  Mode 

The  procedure  for  general  MIMO  full-duplex  radio  is  also  discussed  in  [31].  As¬ 
sume  the  full-duplex  node  has  Nf  transmitter,  the  first  Nr  are  auxiliary  Tx  used  for  self¬ 
interference  cancellation,  last  Ns  =  Nt  —  Nr  are  primary  Tx,  and  JV/  receiver.  The  received 
self-interference  can  be  represented  as 

i[n]  =  H[n]  *  x[n]  =  H[n]  *  G[n]s[n]  (10.7) 
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where,  i[ra]  is  A)  x  1  column  vector  composed  by  the  received  self-interference  at  each  Rx, 
H  [n]  is  Ni  x  Nt  matrix,  whose  ( i ,  j)  element  is  the  self-interference  channel  response  between, 
the  jth  Tx  and  zth  Rx,  x[n]  is  Nt  x  1  vector  contains  the  transmitted  signal  from  each  Tx, 
G[n\  is  the  Nt  x  Ns  transmitting  beamforming  matrix  and  s  [n]  is  the  transmitted  signal  in 
half-duplex  radio.  We  have  to  develop  G[n]  with  the  knowledge  of  H[n]  so  that 


H[n]  *  G[n]  =  0 

For  the  kth  Tx  in  half-duplex  radio,  we  should  have 

i fcN  =  H[n]  *  g k[n\  =  0 


(10.8) 


(10.9) 


then  we  can  have 


Ns 

iN  =  ^ifc[n]  =  0 

k= 1 


(10.10) 


where  bold  gk[n ]  is  the  kth  column  of  G[n],  to  cancel  the  self-interference  cancellation,  we 
should  satisfy  Equ.  (10.9)  for  all  k.  Now  we  can  discuss  the  condition  for  the  existence 
of  the  solution  to  the  aforementioned  equation.  If  J2i=iSLiln\  *  f)  [n]  =  0  implies  ai[n]  = 
a2[n]  =  •  •  ■  =  a/[n]  =0,  f)  are  convolutively  independent.  The  convolution  rank  rcon„(H[n]) 
of  matrix  H[n]  is  the  largest  number  of  columns  (rows)  in  H[n]  that  are  convolutively 
independent.  It’s  more  or  less  like  the  linear  independent  concept.  Similarly  we  have 
the  ?’con„(H[n])  <  min{N  A^}  =  Nr.  The  dimension  of  the  solution  space  of  Equ.  (10.9), 
similarly  called  the  right  null  space  of  H[n],  is  the  number  of  the  convolutively  independent 
solution  to  Equ.  (10.9),  dnuu  =  Nt  —  rcom,(H[n])  >  Ns.  In  practice,  typically  we  will  have 
dnuii  =  A)s,  given  the  rich  scatter  environment.  Therefore  when  the  data  stream  is  less  than 
the  number  of  primary  Tx,  we  can  find  solution  to  Equ.  (10.9).  The  choice  of  (jk  [n]  is  not 
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unique.  One  of  the  solution  we  provide  in  [31]  is  described  as: 

Sk[n] 

Ofc-i,! 

g  k[n\  =  (10.11) 

go,k[n] 

®Nr- 1,1 

where  0m>i  is  the  m  x  1  zero  vector,  gfc[n]  and  go,fcM  are  the  solution  to 

A[n]  *  g k[n\  +  b k[n\  *  go,k[n]  =  0  (10.12) 

where  A[n]  is  a  square  matrix  equal  to  the  first  Nr  columns  of  H[n]  and  bfc[n]  is  the  A^+^th 
column  of  H[n].  Therefore  one  straightforward  solution  is 

gfcN  =  ~adj{A[n}}  *  b k[n\  (10.13) 

and 

9o,k[n\  =  det,{A[n}}  (10.14) 

All  the  values  can  be  obtained  analytically  and  no  division  is  involved. 

10.3  Experimental  Results  and  Conclusion 

The  transmitting  beamforming  method  is  test  using  1)  Agilent  MXA  and  MXG 
with  time-invariant  channel  and  2)  WARP  board  in  real  time-variant  channel.  The  details 
are  recored  below. 
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Figure  10.4:  Agilent  MXA  N9020A  and  MXG  N5182A.  Two  MXG  and  one  MXA  are  used 
as  Tx,  Cx  and  Rx.  A  10  MHz  local  oscillator  is  shared.  We  used  40  MHz  broadband  signal 
at  2.4  GHz  on  this  platform.  Details  of  the  component  are  listed  in  Appendix  A.  The 
schematic  is  shown  in  Figure  10.3. 


Figure  10.4  shows  the  Agilent  MXA  and  MXG  used  in  this  experiment.  Power 
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combiner  ZAPD-4-S+  from  mini-circuit  is  used.  The  time  and  frequency  domain  full- 


duplex  radio  self-interference  caused  by  the  transmitted  signal  are  shown  in  Figure  10.5 
and  Figure  10.6,  respectively,  time  and  frequency  domain  residual  are  shown  in  Figure  10.7 
and  Figure  10.8. 


Agilent  Spectrum  Analyzer  -  Swept  SA 


Figure  10.5:  Self-Interference  in  Frequency  Domain  in  Agilent  MXA  and  MXG  Experiment 
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Agilent  IQ  Analyzer  (Basic)  -  IQ  Waveform 


Figure  10.6:  Self-Interference  in  Time  Domain  in  Agilent  MXA  and  MXG  Experiment 
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Agilent  Spectrum  Analyzer  -  Swept  SA 


Figure  10.7:  Residual  in  Frequency  Domain  in  Agilent  MXA  and  MXG  Experiment 
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Agilent  IQ  Analyzer  (Basic)  -  IQ  Waveform 


Figure  10.8:  Residual  in  Time  Domain  in  Agilent  MXA  and  MXG  Experiment 


As  we  can  see  that  the  signal  at  the  central  frequency  is  canceled  by  50dB.  However, 
we  cannot  do  a  real  time  experiment  in  this  system  due  to  the  limitation  of  the  interface 
provided  by  the  equipment.  Also,  the  RF  chain  of  Agilent  MXA  and  MXG  is  much  superior 
than  the  off-the-shelf  commercial.  For  the  real  time  experiment,  we  use  WARP  radio 
hardware  plat  form  Mango  communication  [64]  as  shown  in  Figure  10.9. 
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Figure  10.9:  WARP  board  for  Full  Duplex  Node  Hardware  Experiment.  On  this  platform 
we  test  the  self- interference  cancellation  for  20MHz  signal  at  2.4GHz.  Three  radio  board 
is  configured  as  Tx,  Cx  an  Rx.  The  local  oscillator  and  clock  is  shared.  Details  of  the 
component  and  the  related  hardware  are  listed  in  Appendix  A. 


The  key  components  of  the  platform  is  listed  below: 

1.  Xilinx  Virtex-  FX100  FPGA  (XC4VFX100-11FFG1517C) 

2.  20MHz  and  40  MHz  temperature  compensated  oscillators  for  RF  carrier  reference  and 
logic  and  converter  clocks  respectively. 

3.  Dual  AD9510  low  jitter  buffers  for  clocks. 

4.  Dual  65MS/sec  14-bit  analog-digital-converter  (ADC)  for  Rx  I/Q  (AD9248) 

5.  Dual  125  MS/sec  16-bit  digital-analog-converter  (DAC)  for  Tx  I/Q  (AD9777) 
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6.  20MS/sec  10-bit  ADC  for  receiver  signal  strength  indicator  (RSSI)  (AD9200) 

7.  2.4  &  5GHz  RF  transceiver  (MAX2829) 

8.  18dBm  power  amplifier. 

9.  Power  combiner  ZAPD-4-S+ 

10.  2.4GHz  antenna 

More  details  of  the  involved  component  are  listed  in  Appendix  A.  Comparing  with  the 
Aigilent  MXA  and  MXG  experiment,  the  RF  and  analog  component  are  less  expensive 
and  accurate.  This  experiment  is  more  practical  given  that  most  of  the  component  are 
commercial  off-the-shelf  ones.  Firmware  and  software  were  developed  based  on  the  reference 
design  of  WARP  project.  The  concept  schematic  is  shown  in  Figure  10.10  with  most  of  the 
interface  neglected  for  convenience.  Similar  to  the  Angilent  MXA  and  MXG  experiment, 
three  radio  chain  is  shown  as  transmitter,  cancelmitter  and  receiver. 


Programmable  &  Configurable 


Figure  10.10:  The  structure  of  the  experiment  system. 
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An  example  of  the  time  domain  waveform  in  a  single  experiment  is  shown  in  Fig¬ 


ure  10.11.  We  can  see  from  the  upper-left  plot  that  without  transmitting  beamforming, 
self-interference  signal  saturated  the  receiver  (notice  the  clipping  at  the  peak  of  the  wave¬ 
form.)  While  in  the  lower-left  plot  the  residual  interference  is  within  the  dynamic  range, 
which  can  be  further  removed  by  applying  baseband  digital  method. 


x  -|  (^Interference  waveform  x  -^Cancellation  waveform 


Figure  10.11:  The  upper  left  plot  shows  the  received  Rx  waveform  when  only  Tx  is  transmit- 
ting(interference  waveform),  the  upper  right  plot  shows  received  Rx  waveform  when  only  Cx 
is  transmitting  (cancellation  waveform),  the  lower  left  plot  shows  the  received  Rx  waveform 
when  Tx  and  Cx  are  both  transmitting,  and  the  lower  right  plot  shows  the  transmitted 
ideal  waveform.  All  the  plots  are  captured  within  25.6ns. 


In  this  experiment  we  use  four  antenna  position  with  repeated  test  to  get  the 
statistics  of  the  cancellation.  As  shown  in  Figure  10.12,  Tx  is  placed  in  the  same  location 
(Marked  with  Tx)  while  Rx  has  four  locations  (Marked  with  Rxl,  Rx2,  Rx3  and  Rx4).  The 
corresponding  results  are  the  curve  labeled  with  Cl,  C2,  C3  and  C4  in  Figure  10.13. 
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Figure  10.12:  Photo  of  the  equipment  and  position  of  Tx  and  Rx  antenna.  To  verify  the 
cancellation  performance  with  multiple  different  surrounding  structure.  The  Tx  is  placed  in 
a  fixed  location  and  Rx  has  four  location  as  shown.  Cx  is  connected  via  circulator  therefore 
no  Cx  antenna.  Identical  power,  bandwidth,  carrier  frequency  are  used  for  all  the  test. 
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INR  in  dB 


Figure  10.13:  In  the  legend,  ‘Before’  stands  for  self-interference,  ‘RF’  stands  for  the  proposed 
cancellation  method,  ‘BB’  stands  for  the  baseband  cancellation  and  ‘C n’  stands  for  the 
configuration  of  antenna  position  with  index  n  in  Figure  10.12. 


As  we  can  see  that  the  cancellation  amount  is  less  than  the  Agilent  MXA  and 
MXG  experiment.  Roughly  27dB  is  canceled  before  digitizing.  This  leaves  the  residual  in 
the  dynamic  range  that  can  be  further  removed  by  digital  signal  processing  method.  This 
method  prevent  the  LNA  from  saturation  and  reduce  the  dynamic  range  loss  by  27dB. 
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10.4  Limitation  of  transmitting  Beamforming:  transmitting 


Noise 

The  dominating  random  noise  floor  is  not  introduced  at  the  receiver  side.  Instead 
the  noise  floor  raised  with  the  transmitting  power.  As  shown  in  Figure  10.2,  the  noise  floor 
of  self-interference  is  about  9  dB  higher  than  the  receiver  noise  floor.  Traditionally,  the  noise 
floor  from  remote  transmitter  is  buried  by  the  receiver  noise  floor.  However,  in  full-duplex 
radio,  the  transmitter  noise  is  dominant.  Since  we  cannot  cancel  the  random  transmitting 
noise,  this  effect  puts  an  upper  limits  of  the  cancellation  we  can  achieve  using  transmitter 
beamforming  method.  In  other  words,  the  actual  transmitted  signal  p[n]is  not  exactly  p[n\. 
It  can  be  modeled  as: 

p[n\  =  p[n]  +  r[n ]  (10.15) 

And  the  transmitting  SNR  is  defined  as 

SNRt  =  20  log  (10.16) 

IrMII 

where  r[n]  is  a  random  noise,  whose  power  increases  with  the  transmitting  power.  We 
verified  the  transmitting  noise  with  MXA  and  MXG,  as  shown  in  Figure  10.4,  is  60dB, 
while  the  transmitting  of  WARP  board  is  only  about  30dB.  Also  the  transmitting  noise 
will  brought  extra  error  in  self-interference  channel  estimation,  but  the  main  effect  is  that 
the  transmitting  noise  cannot  be  canceled  during  full-duplex  communication.  This  barrier 
limits  the  transmitting  power  for  a  full-duplex  node.  Without  a  method  to  overcome  this 
barrier,  the  full-duplex  radio  will  not  be  practical. 
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Chapter  11 


Adaptive  Filter  Method 


Beamforming  method  can  cancel  the  self-interference  down  to  the  transmitting 
noise  floor.  For  long  distance  communication,  the  attenuation  between  the  Tx  and  remote 
Rx  is  larger.  With  larger  power  gap  between  the  transmitting  and  the  receiving  power, 
beamforming  method  become  less  practical  for  long  distance  full-duplex  radio.  For  prac¬ 
tical  long  distance  full-duplex  radio,  we  have  to  cancel  the  self-interference  beyond  the 
transmitting  SNR.  One  option  is  to  use  transmitter  with  less  transmitting  noise,  just  like 
the  Agilent  MXG.  This  solution  may  become  feasible  in  the  future.  Since  we  cannot  repro¬ 
duce  the  transmitting  noise,  the  only  way  to  cancel  it  is  to  use  filtered  version  of  that  noise 
to  cancel  itself.  Another  method  we  proposed  later  is  to  use  an  adaptive  filter  to  create  an 
”  opposite”  self- interference  channel  to  cancel  the  self-interference. 
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11.1  Adaptive  Filter  Architecture 


The  essence  is  to  construct  a  ’’opposite”  channel,  which  cancels  the  actual  self¬ 
interference  channel.  In  [52],  we  provide  a  detailed  model  with  detailed  block  for  the 
adaptive  filter  method.  The  basic  structure  is  shown  in  Figuire.  11.1. 


N°'se  Desired  signal 

w(n)  s(n) 


Output 

y(n) 


Feedback  control  loop  for  adapting  G 


Figure  11.1:  Adaptive  filter  for  self-interference  cancellation 

The  input  signal  x[n ]  is  the  digital  source  interference  signal  before  DAC  in  the 
transmit  chain.  w[n]  is  the  transmitting  noise  from  the  entire  transmit  chain.  s[n]  is  the 
desired  remote  signal.  y[n]  is  the  only  observed  signal  after  ADC.  y[n ]  has  two  component, 
one  is  the  self-interference,  the  other  is  the  desired  remote  signal.  H \  represents  the  transfer 
function  of  all  the  path,  and  none  of  them  is  known.  The  detailed  description  of  each  H 
are  as  follow: 

1.  H\  represents  the  equivalent  baseband  channel  between  the  digital  domain  and  the 
output  of  the  RF  power  amplifier.  All  the  noise  is  added  after  Hi  for  convenience. 
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2.  H2  represents  the  channel  between  the  transmit  antenna  and  the  receive  antenna.  If 
other  component,  such  as  circulator  is  used,  H2  would  represent  the  combination  of 
isolation  path  of  the  RF  circulator  and  the  local  reflect  channel. 

3.  H3  represents  the  channel  between  the  receive  antenna  and  an  analog  baseband  signal 
combiner  before  LNA. 

4.  represents  the  channel  between  the  signal  combiner  and  the  output  y  [n] ,  including 
LNA,  VGA  and  ADC. 

5.  Hr,  represents  the  channel  between  the  output  of  the  transmit  power  amplifier  in  the 
transmit  chain  and  the  input  of  the  digital-controlled  filter. 

6.  Hq  represents  the  channel  between  the  output  of  the  digital  controlled  filter  G  and 
the  signal  combiner. 

If  the  transfer  function  G  satisfied  that 

H6GH5  =  -H3H2  (11.1) 

then  both  x[n]  and  w[n]  will  not  affect  y[n\.  We  can  cancel  the  transmitting  noise  by  using 
the  adaptive  filter.  To  avoid  introducing  additional  noise,  it  is  preferred  that  only  passive 
component  are  used  in  the  adaptive  filter.  Below  we  first  show  that  the  attenuator-delay 
combination  is  adequate  for  construct  the  adaptive  filter  G.  Our  discussion  is  based  on  a 
most  accepted  structure  of  adaptive  filter  G  as  shown  in  Figure  11.2,  named  as  all- analog 
canceler. 
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Interference  Channel 


Figure  11.2:  Architecture  of  adaptive  G  filter  with  taps 

11.1.1  Single  Tone  Signal  Cancellation 

If  there  is  only  one  radio  path  between  the  transmitter  and  receiver  and  single 
tone  is  used  for  wireless  transmitting.  The  received  signal  can  be  express  as 

y(t )  =  x(t  -  r)Aexp(j(9)  (11.2) 

where  y(t )  is  the  received  signal,  x(t)  is  transmitted  signal,  A  is  the  amplitude  attenuation 
of  the  path,  and  9  is  the  phase  caused  by  the  path.  We  can  construct  a  cancel  path  using 
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one  attenuator  to  create  a  path  with  the  received  signal  being: 


y(t)  =  x(t  —  r)Aexp(j9)  =  —x(t  —  r)Aexp(j6)  (11.3) 

If  multiple  path  are  presented,  we  can  use  multiple  cancellation  path.  The  ideal  sum  of 

interference  and  cancellation  can  be  expressed  as: 

N  N 

y(t )  +  y(t )  =  ^2  An  exp (jOn)x(t  -  Tn)  +  ^2  An  exp (j9n)x(t  -  rn)  =  0  (11-4) 

n= 1  n— 1 

The  method  for  finding  the  attenuation  and  delay  and  constructing  the  cancel  filter  in 
practice  will  be  discussed  later.  This  concept  is  verified  using  RF  signal  generator(SG) 
Agilent  MXG  N5182A  and  RF  spectrum  analyzer(Sa)  Agilent  MXA  N9020A.  We  connect 
SG  and  SA  using  two  cable  with  different  length  and  RF  combiner,  splitter  and  attenuator. 
This  block  provides  us  a  two  path  channel  with  similar  attenuation  and  different  delay.  In 
practical  design,  without  the  knowledge  of  the  delay  and  the  attenuation,  we  can  connect  a 
sequence  tunable  step  attenuator,  each  with  a  different  fixed  delay  for  the  cancel  block.  With 
small  enough  step  and  difference  between  delay,  the  ideal  cancel  channel  can  be  arbitrarily 
accurate  approximated. 

11.1.2  Broadband  Signal  Cancellation 

Broadband  can  be  view  as  the  combination  of  plenty  of  single  tone  signal.  Each 
attenuation  path  will  result  in  a  different  phase  and  similar  attenuation  for  each  tones.  For 
example,  x(t)  with  two  tones  at  f\  and  /2  are  transmitted,  and  received  as 

V(t )  =  Th  (*  -  r)Ah  exp (JOfi)  +  Xf2(t  -  r)Ah  exp (jOh)  (11.5) 
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In  this  case,  we  need  at  least  two  independent  cancel  path  to  cancel  the  self-interference. 


Let  the  received  signal  of  the  cancel  path  be: 

ya(t)  =  xfl(t  -  T)Aaex.p(jOafl)  +  xaf2(t  -  T)Aaexp(j9af2)  (11.6) 

Vb(t)  =  xfl(t  -  r))Ab  exp(j0bfl)  +  xbf2(t  -  T)Abexp(jdbf2)  (11.7) 

Adjust  A a ,  .4 b .  Qaf i,  0af2  •}  @bf\  and  0bf2 

-Afl  exp (jdfl)  =  Aaexp(jeafl)  +  Abexp(j6bfl )  (11.8) 

and 

-Ah  exp (jOf2)  =  Aa  exp (j0af2)  +  Ab  exp {jObf2)  (11.9) 

Then  the  interference  from  the  two-tone  signal  can  be  canceled.  Multiple-tone  or  broadband 
signal  interference  can  be  canceled  similarly.  It  is  also  straightforward  to  achieve  similar 
result  for  multiple  path  case.  For  each  path,  we  use  a  set  of  cancel  channel  to  cancel  the 
broadband  signals.  The  conclusion  is  that,  with  enough  cancel  path,  we  can  construct  a 
inverted  broadband  channel  to  cancel  the  self-interference. 

11.2  Evaluation  Criteria 

The  adaptive  filter  is  could  be  composed  with  several  tap,  each  is  a  tunable  step- 
attenuator  connected  with  fixed  delay.  Or  it  could  contains  other  component  such  as  phase 
splitter,  circulator  and/or  RF  switches.  The  adaptive  filter  changes  as  the  value  of  the 
attenuators.  And  currently  there  is  no  tunable  delay  available.  There  are  many  other 
possible  ways  to  construct  the  adaptive  filter.  Before  discussing  the  variety  of  the  structures, 
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we  discuss  the  evaluation  criteria  of  the  adaptive  filterG*  first.  Since  the  resolution  of  the 


step  attenuator,  or  other  tunable  component,  are  limited,  the  number  of  possible  adaptive 
filter  G  is  limited.  We  call  each  configuration  of  adaptive  filter  G  a  channel  candidate.  To 
compensate  the  actual  interference  channel,  which  is  random  and  time- variant,  we  are  favor 
in  the  structure  that  provide  more  channel  candidate  with  diversity.  For  example,  we  put  4 
step  digital-attenuator  in  series  as  shown  on  the  left  side  of  figure  11.4.  Let  each  attenuator 
have  0  to  31.5dB  with  step  size  0.5dB  and  assume  the  phase  associated  with  the  attenuation 
is  linear  as  shown  in  Figure  11.3. 
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Figure  11.3:  Relay  phase  vs  Attenuation 

In  other  words,  each  attenuator  has  26  possible  values.  The  whole  structure  has 
totally  224  possible  settings.  However,  a  lot  of  these  setting  results  in  same  attenuation. 
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The  structure  can  only  be  OdB  to  126dB,  28  possible  values,  much  smaller  than  the  possible 
settings.  If  we  connect  as  on  the  right  side  of  figure  11.4,  it  would  have  more  possible  values. 


Figure  11.4:  Two  different  taps  composed  by  four  step  attenuator 


Even  though  both  tap  have  same  number  of  the  relay  configurations,  the  combined 
candidates  shows  a  different  distribution  as  shown  in  Figure  11.5. 
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Figure  11.5:  Distribution  of  different  configuration  of  attenuators. 

In  the  figure,  the  black  circle  shows  the  candidates  from  series  connected  attenuator 
while  the  red  cross  shows  the  candidates  from  the  parallel  connected  attenuator.  Besides 
the  number  of  the  channel  candidates,  other  features  such  as  cost,  additional  modification 
for  upgrading  the  current  radio  is  also  important  evaluation  criteria,  but  that  is  beyond  the 
scope  of  this  dissertation. 

11.3  Adaptive  Filter  G  with  Complex  Taps 

One  of  the  adaptive  filter  proposed  by  us  in  [63]  is  to  construct  complex  taps  as 
shown  in  Figure  11.6.  As  we  know  that  the  equivalent  baseband  channel  is  a  complex  chan- 
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nel.  If  we  can  construct  a  complex  baseband  channel,  then  we  can  create  the  corresponding 
RF  channel  accordingly. 


Figure  11.6:  Complex  tap 
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Figure  11.7:  The  baseband  equivalent  Complex  tap 

However,  the  attenuator  we  have  only  provides  certain  attenuation  and  a  hxed 
delay  associated  with  it.  Assume  the  phase  is  linearly  associated  with  the  attenuation  (in 
dB),  Figure  11.7  shows  the  equivalent  complex  vector  contributed  by  the  step  attenuator  in 
Figure  11.6  with  smallest  attenuation.  The  x  axis  is  the  response  projected  to  I  channel  and 
y  axis  is  the  response  projected  to  Q  channel.  Larger  the  amplitude,  smaller  the  attenuation. 
If  the  attenuator  has  infinite  small  resolution,  then  the  achievable  area  would  be  the  convex 
combination  of  vector  gn<i ,  gn<2,  gn;s  and  gU) 4.  The  convex  combination  is  defined  as: 

=  CHSVi,!  +  Oi2gn,2  +  Ofi9n,Z  +  C*4Sn,4  (11.10) 
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where  a*  6  [0, 1],  Vi.  The  reason  that  oti  cannot  be  negative  is  because  the  attenuation  must 
be  non-negative,  otherwise  it  invert  the  signal.  Therefore  we  can  acquire  arbitrary  phase 


with  complex  tap.  By  connecting  the  complex  taps  in  series  with  small  enough  interval,  we 
can  achieve  arbitrary  bandwidth  of  the  adaptive  filter  G.  Comparing  to  other  structure, 
such  as  uniform  distributed  delay  in  [41],  complex  taps  performances  better.  Details  of  the 
comparison  can  be  founded  in  [63]  .  The  off-the-shell  step  attenuator  from  Mini- Circuits 
ZX76-31R5-SP-S+  has  a  non-linear  phase  associated  with  the  attenuation.  The  test  results 
of  ZX76-31R5-SP-S+  phase  vs  attenuation  is  shown  in  Figure  11.8 
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Figure  11.8:  Ideal  pase  vs  attenuation  and  the  measured  one 
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This  non-linear  phase  will  affect  the  distribution  of  the  candidates  provided  by  the 


complex  taps.  A  simulation  are  shown  in 
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Figure  11.9:  Candidates  of  complex  tap  using  ideal  and  non-ideal  attenuator 


The  red  dots  are  the  candidates  from  the  ideal  attenuator  complex  tap  and  the 
blue  circles  are  from  the  non-ideal  attenuator. 
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11.4  Experiments 


We  have  built  a  single  complex  tap  to  verify  the  aforementioned  idea.  As  shown 
in  Figure  11.10,  the  complex  tap  is  made  by  90°  splitter,  step  attenuator,  power  combiner 
and  controlled  by  USB  I/Q  interface  provided  by  Mini  Circuits. 


Figure  11.10:  Single  complex  tap.  The  component  is  connected  as  shown  in  Figure  11.6. 
Attenuator  is  ZX76-31R5-SP-S+  from  mini  circuit.  All  attenuator  is  controlled  by  the  USB 
I/Q  interface  with  shared  data  and  clock.  We  tested  40  MHz  broadband  signal  at  2.4  GHz. 
Details  of  the  component  and  USB  I/Q  interface  are  listed  in  Appendix  A. 


We  use  this  simple  complex  tap  to  cancel  the  single  path  self-interference.  With 
the  limits  of  the  step  size  of  the  attenuator  and  non-real-time  interface,  we  achieved  30 
dB  cancellation.  Another  solution  we  are  currently  working  on  is  a  complex  multi-tap 
cancellation  board  with  real  time  control.  We  integrated  several  complex  taps  on  a  single 
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PCB  board  with  the  control  interface  for  FPGA,  up  to  32  complex  tap  can  be  configured  in 


real  time  for  self-interference  cancellation.  The  PCB  of  the  complex  multi-tap  cancellation 


board  is  shown  in  Figure  11.11.  Details  of  the  component  are  listed  in  Appendix  A. 


Figure  11.11:  PCB  of  the  complex  multi-tap  cancellation  board.  Attenuator  is  PE43703 
from  Peregrine  Semiconductor.  Details  of  the  component  are  listed  in  Appendix  A. 


11.5  Adaptive  Filter  with  Active  Component 

One  way  to  build  this  filter  G  is  using  another  Rx  sample  the  transmitting  noise 
with  an  attenuator.  Then  the  received  self-interference  is  processed  with  DSP  technique 
and  send  out  with  another  Tx  to  combine  before  in  Figure  11.1.  In  this  way,  the 
sampling  Rx  the  self-interference  could  use  all  the  dynamic  range  for  the  self-interference. 
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The  combination  before  would  cancel  the  the  self-interference  up  to  the  dynamic  range 


of  the  sampling  Rx  and  the  Rx  can  use  all  its  dynamic  range  for  the  remote  desired  signal 
and  the  residual  of  the  self-interference.  This  is  more  like  concatenated  the  dynamic  range 
of  two  Rx.  And  multiple  layer  of  this  Rx  can  be  used  if  necessary.  The  advantage  of  this 
method  are: 


1.  G  can  be  easily  manipulated  in  complex  baseband  form. 

2.  Multiple  Rx  can  be  used  together  to  achieve  totally  high  dynamic  range. 

3.  The  algorithm  is  well  understood  as  success  interference  cancellation. 

However,  this  method  is  also  suffered  from  the  following  disadvantage: 

1.  The  delay  of  path  contain  adaptive  filter  G  is  extreme  short  since  the  self-interference 
channel  has  a  short  path. 

2.  The  additional  Rx  and  Tx  will  introduce  noise.  Though  this  noise  is  less  significant 
than  the  original  transmitting  noise,  it  will  degrade  the  performance. 

3.  Additional  Rx  and  Tx  increase  the  cost  and  heat  dissipation. 
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Chapter  12 


Tuning  Algorithm  for  Adaptive 
Filter 


With  the  previous  analysis,  we  showed  that  the  adaptive  filter  architecture  is 
capable  of  creating  a  opposite  channel.  The  following  discussion  is  about  how  to  find  the 
optimal  solution  of  adaptive  filter  G.  Here  is  the  system  model  in  terms  of  G,  represented 
in  the  complex  baseband  form.  During  the  training,  we  assume  no  desired  remote  signal, 
the  received  y[n\  in  Figure  11.1  is  purely  caused  by  the  self-interference  x[n].  We  model 
y[n }  as  the  linear  function  of  x[n]  and  w[n ]  and  affine  function  of  impulse  response  of  G[n ] 

12.1  Complex  Linear  System 

Let’s  denote  the  impulse  response  of  filter  G  as  g[0],  g[  1],  •  •  • ,  g[L\.  With  the  linear 
system  assumption,  which  is  reasonable,  for  any  given  training  pulse  x[n],  n  =  0,  1,  2,  •  •  • , 


N  —  1  and  a  proper  guard  interval,  we  have 


y  i  =  (F  +  w.j)g  +  f  +  w  i 


(12.1) 


where  y,;  =  [y[0],  y[  1],  •  •  •  ,  y[N  —  1]}T  is  the  vector  of  the  corresponding  outputs  in  the  ith 
realization,  g  =  [<7 [0] ,  g[ l],---  ,  g[L]]T ,  F  and  f  are  unknown  linear  function  of  x[n],  and 
W  and  w,  are  unknown  linear  function  of  noise  w[n].The  same  training  sequence  x[n]  is 
used  repeatedly.  Denote  the  energy  of  y*  by  ||y||2,  i  £  [1,  M].  For  large  M,  the  average 
energy  of  the  self-interference  is  given  by 


M 


e=  M  X!  llyiH2  =  +  Rw)g  +  2i?e{(rF  +  rw)g}  +  rf  +  rv 


(12.2) 


i= 1 


where  RF  =  F^F,  Kw  =  rf  =  fHF,  rf  =  f  w«W,  rf  =  fHf ,  and 

rw  =  yj  wfwi-  Equivalently,  we  can  lump  some  notations  together,  the  previous 
equation  becomes: 

e  =  g^Ag  +  Rejg^b}  +  c  (12.3) 

where  A  =  A77,  b  and  c  are  unknowns  to  be  determined  during  training  before  determine 
the  optimal  g.  Define 


A  = 


A  r  Aj 


A  j  A,. 

T  „T1 T 


g  =  [gr  >  Si  ] 

b  =  [bf  b 


(12.4) 

(12.5) 

(12.6) 


where  Ar  =  Re{A}  =  AjT,  Aj  =  Im{ A}  =  Af,  gr  =  i?e{g},  gj  =  Im{ g},  br  =  i?e{b} 
and  bj  =  Im{b}.  Equ.  (12.3)  can  be  rewrite  as: 


e  =  g7  Ag  +  g7  b  +  c 


(12.7) 
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Furthermore,  we  can  write 


e  =  g1  <g>  g1  vec(  A)  +  g1  b  +  c 


=  [gf,  S  I]  ®  [g  f,  g  f] 


uec 


uec 


V  A'  / 

/-A  ^ 


V  A-  J 


+  grb  +  c 


Sr 


T  ^  r  „T  „T 


,  g,  Juec 


/  A  f 

(  A  \ 

Ar 

+  gf  ®  [gf ,  gf ]vec 

-A./ 

A.  J 

A.,  J 

+  grb  +  c 


=  {gf  ®  [grk  s!  \  +  g I  ®  [gf ,  -gfj} 


uec 


/A  ^ 


vA-y 


+  g1  b  +  c 


(12.8) 


where  <8>  is  the  outer  product,  and  vec(-)  is  the  vectorization  function  Let  P  be  a  permutation 
function  that 


vec 


1  A  i 


\A‘J 


Then 


vec{  Ar) 
y  vec(Ai) 

( 


(12.9) 


=  {gf®[gf,  gf]  +  gf®fef,  -gf]}P 


vec(  Ar) 


\ 


+  g2  b  +  c 


y  vec(Ai)  J 

( 

=  { [gf  ®  gf ,  gf  ®  gf  ]  +  [gf  ®  g I,  ~g f  ®  gf  ] } 


\ 


=  {[gf  ®  gf  +  gf  ®  gf ,  gf  ®  gf  -  -gf  ®  gf]} 


vec(  A 

y  uec(Aj)  y 

/  ,A 

vec{Ar) 


+  gTb  +  c 


y  vec(Ai)  J 


+  gTb  +  c  (12.10) 
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Since  Ar  =  AJ  and  Ai  =  —Ai,  only  lower  triangular,  including  the  diagonal,  element  of  Ar 
are  independent  and  only  the  strictly  lower  triangle,  excluding,  the  diagonal,  elements  of  A* 
are  independent.  We  now  define  the  selection  matrices  SrjJ+i  and  that  Sr)L+iuec(Ar) 

and  Sj  L+\vec(Ai)  only  contains  the  independent  element.  Then 

e  =  uTv  (12.11) 


where 


v  =  [Sr.)L+iwec(Ar)SiiL+i'yec(Ai)bc]  (12.12) 

uT  =  [uf  4  gT  1]  (12.13) 

where  uf  =  (gj  0  gj  +  g[  0  gf  )S£  L+1DL+U  =  2(gJ  0  g f  -  gf  0  gj )Sit  L+1  and  for 
example 


D4  =  diag[  1  2  2  2|  1  2  2  |  1  2  |  1]  (12.14) 

Assume  we  use  a  unique  g [k],  k  £  [1,  K]  for  every  M  realizations.  Then  we  get  a  corre¬ 
sponding  e[k]  that 

e  =  Gv  (12.15) 


where  e  =  [e[l],  •  •  •  ,  e[K]]  and 


G 


u[l]r 


^  -\-2L-\-3) 


(12.16) 


[  u[A]T  J 

and  u[k]  is  defined  in  Equ.  12.13.  If  G  has  a  full-column-rank,  the  least  square  solution  of 
v  is  given  by 


v  =  (GTG)-1GTe 


(12.17) 
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Vector  v  has  all  the  information  of  estimates  of  A,  b  and  c.  According  to  Equ.  (12.7),  with 
the  estimates  A,  b  and  c,  the  optimal  adaptive  filter  g  is  given  by 


xb 


(12.18) 


12.2  Real  Linear  System 


If  there  is  an  I/Q  imbalance  phenomena  in  the  circuit,  the  complex  linear  model 
fails.  We  can  use  a  real  linear  model  to  replace  the  complex  one.  I/Q  imbalance  phenomena 
is  discussed  later.  The  real  linear  model  is  more  relax  problem  than  the  complex  one,  since 
the  in  phase  and  quadrature  signal  are  decoupled.  But  realize  an  adaptive  filter  for  real 
linear  model  is  much  more  difficult  and  therefore  less  practical.  Luckily  the  I/Q  imbalance 
is  not  severe  practically  and  complex  model  can  be  applied  generally.  The  linear  model  can 
be  written  as: 

e  =  g1  Ag  +  gb  +  c  (12.19) 

were  the  real  matrix  A  only  has  the  symmetric  property  A1  =  A.  Similarly 


where 


e 


=  U  V 


(12.20) 


v  = 


S2L+2^ec(A)bc 


V  =  [(gr  ®  gT)S|L+2D2L+2  |  gr|  1 


VTi 


(12.21) 

(12.22) 


92 


12.3  Training  vector  g[k\  Design 


One  critical  condition  for  finding  the  optimal  adaptive  filter  G  is  to  form  the 
aforementioned  matrix  G  with  full-column-rank.  With  more  realization  of  training,  G  will 
be  full-column-rank.  However,  using  the  minimum  number  of  training  sequence  to  find  a 
full-column-rank  G  with  a  small  condition  number  worth  discussion.  One  way  to  construct 
G  is  to  make  it  as  orthogonal  as  possible.  In  [52] ,  a  choice  of  training  vector  is  introduced, 
which  results  in  a  sparse  matrix  G.  The  sparseness  is  also  useful  for  reduced  computation. 


12.3.1  Real  System 

For  real  system,  A  is  a  symmetric  matrix.  Let  g  have  the  dimension  m  x  1. 
Therefore  the  number  of  unknown  real  parameters  in  the  systems  is  Nm  =  m(m  +  l)/2  + 
m  +  1.  Denote  the  corresponding  G  as  Gm  and  g[n]  =  [<7i[n],  g^n],  ,  ■■■  ,  gm[n]]T  with 
n  G  [1,  N].  If  m=2,  N2  =  6  and 


31  [l]2 

2</i[l]<fc[l] 

32[1]2 

31  [1] 

32  [1] 

1 

91  [2]2 

2<?i[%2[2] 

32  [2] 2 

31  [2] 

32  [2] 

1 

9i  [3]2 

2<7i  [3]  <72  [3] 

32  [3] 2 

3i  [3] 

32  [3] 

1 

91  [4]2 

25i  [4]  92  [4] 

32  [4] 2 

3i  [4] 

32  [4] 

1 

91  [5]2 

25i[5]ff2[5] 

32  [5] 2 

3i  [5] 

32  [5] 

1 

91  [6]2 

23i  [6]  52  [6] 

32  [6] 2 

31  [6] 

32  [6] 

1 

To  construct  a  nonsingular  G2,  one  of  the  choice  could  be 
1-  g[l]  =  [0,  0] 
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2-  g[2]  =  [1,  0] 


3.  g[3]  =  [-1,  0] 

4.  g[4]  =  [0,  1] 

5.  g[5]  =  [0,  -1] 

6-  g[6]  =  [1,  1] 


The  corresponding  G2  is 


1 

0 

0 

0 

0 

0 

1 

1 

0 

0 

1 

0 

1 

1 

0 

0 

-1 

0 

1 

0 

0 

1 

0 

1 

1 

0 

0 

1 

0 

-1 

1 

1 

2 

1 

1 

1 

1 

(12.24) 


It  is  easy  to  verify  that  G2  has  full  rank  and  the  det( G2)  =  1.  For  any  given  rri,  we  can 
show  that  using  the  following  procedure  will  make  a  full  rank  Gm : 


1.  gi[  1]  =  0,  for  i  €  [1,  ,  •  •  •  ,  m\ 


2.  gi[2i\  =  1,  for  *  €  [1,  ■  ■  ,  m\ 


3.  gi[2i  +  1]  =  -1,  for  i  €  [1,  ,  •  •  •  ,  m\ 

4.  gik[k  +  m  +  1]  =  gjk[k  +  m  +  1]=1,  k  G  [1,  •  •  •  ,  Nm  —  m  —  1],  where  (*&,  jk)  is  the  /cth 
pair  of  elements  out  of  {1,  , . . . ,  m}  and  ik  <  jk 
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5.  g[n\  =  0  otherwise. 


To  prove  this  procedure  will  generate  the  full  rank  Gm,  assume  -/Vm_i  x  IVm_i  matrix 
generated  using  the  aforementioned  procedure  has  full  rank  7Vm_i.  Given  Nm  = 
m(rn  +  l)/2  +  m  +  1,  Nm  =  lVm_i  +  m  +  1,  Gm  has  m  +  1  additional  columns  and  rows. 
One  can  then  verify  that  there  are  a  row  permutation  matrix  Pi,  a  column  permutation 
matrix  P2  and  an  elimination-by-row  matrix  T m  such  that 

Gm_i 

P1TmGmP2  =  j  (12.25) 

21  m—  1 

where  the  two  rows  associated  with  the  component 

1  1 

J  =  (12.26) 

1  -1 

in  the  above  are  due  to  the  two  entries  in  Gm:  g(2 m)  =  [0,  •  •  •  ,0, 1]T  and  g(2m  +  1)  = 
[0,  •  •  •  ,0,  —  1]T.  The  m  —  1  rows  associated  with  the  component  2Im_i  in  the  above  are 
due  to  the  m  —  1  entries  in  Gm :  g (Nm  —  m  +  1  +  i)  =  [ef ,  1]T,  i  =  1,  •  •  ■  ,  m  —  1,  where 
e*  is  the  ( m  —  1)  x  1  vector  of  all  zeros  except  the  one  value  at  its  ith  position.  Hence, 
det(Gm)  =  —  2m  det(Gm_i).  Given  det(G2)  =  23,  we  have  det(Gm)  =  (— l)m2  2  . 

12.3.2  Complex  System 

For  the  complex  system,  the  matrix  G  has  an  additional  structure.  If  we  let 
g  have  the  dimension  k  x  1  where  k  =  2m,  then  there  are  total  M*.  =  (fc/2)2  +  k  +  1 
unknown  real  parameters  in  the  system.  Note  that  g (n)  =  [g^(?r),g J (n)]T  where  g r(n)  = 
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\gr,\(n),  ■  ■  ■  ,gr,m(n)]T  and  g *(n)  =  [gi,i(n),  ■  ■  ■  ,gi)m(n)]T.  Also  note  that  Mk  =  Nm  + 
(m  i )m.  m  yye  will  c]erio^-e  ^jie  corresponding  G  by  G(ky  We  will  propose  a  choice  of 
g(n)  for  n  =  1,  ■  ■  ■  ,  Mk  to  construct  a  nonsingular  Mk  x  Mk  matrix  G(ky 
There  is  a  permutation  matrix  such  that  as  follows: 


G(fc)P  (fc) 


Gi,i 

Gi,2 

G2,l 

G2,2 

(12.27) 


where  the  first  block  column  only  depends  on  g r(n)  and  the  second  block  column  depends 
on  both  g r(n)  and  g i(n).  Furthermore,  Gu  G  RNm*Nm  jiag  t}ie  exactly  same  structure  as 
Grn  in  the  real  system  with  g(n)  replaced  by  g r(n). 

For  1  <  n  <  Nm ,  we  propose  to  choose  g,;(n)  =  0  but  at  the  same  time  choose 
g r(n)  in  the  same  way  as  for  the  real  system.  Then,  we  know  that  Ggi  is  nonsingular  and 
Gg2  =  0.  Furthermore,  for  any  G2,2;  there  is  an  elimination-by-row  matrix  such  that 


T(fc)G(fc)P(fc)  — 


(12.28) 


Gi.i  0 
0  G2,2 

It  is  now  sufficient  to  choose  Mk  —  Nm  =  —  2^m  +  ?n  additional  g(n)  to  make  G2,2  full 
rank. 


For  m  =  3,  the  structure  of  the  nth  row  of  G2,2  is 


(G2,2)n,:  = 


28gr,i(l,2) 


2Sgr,i(l,3)  2^(2,  3) 


Si 


(12.29) 


where  Sgrji(k,l )  =  gr(k)gi(l )  -  gi(k)gr(l)  and  gf  =  [&(!),  gi(2),  £fe(3)j.  If  we  choose 


1.  g(IV3  +  1)  =  [0, 0, 0, 1,  0,  0]T 

2.  g(As  +  2)  =  [0, 0, 0,  0, 1,  0]T 
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3.  g(iV3  +  3)  =  [0, 0,  0,  0,  0, 1]T 

4.  g(IV3  +  4)  =  [1, 0,  0,  0, 1,  0]T 

5.  g(IV3  +  5)  =  [1, 0,  0,  0,  0, 1]T 

6.  g(iV3  +  6)  =  [0, 1,  0,  0,  0, 1]T 

then  we  have  (up  to  a  column  permutation) 

G2,2  =  diag  (I3,  2I3)  (12.30) 

For  any  given  rn.  we  choose  g(n)  for  Nm  +  1  <  n  <  Mk  as  follows: 

1-  gi,n(n  +  Nrn)  =  1  for  n  =  1,  •  •  •  ,  rn 

2.  gr,nk(Nm  +  m  +  k)  =  gi,mk( Nm  +  m  +  k)  =  1  for  1  <  k  <  (m~Vm  where  ( nk,mk )  is 
the  /cth  pair  of  elements  from  { 1 ,  •  •  •  ,  m}  satisfying  n &  <  rn^ 

3.  gr,j{n)  =  0  and  gi,j{n)  =  0  otherwise. 

One  can  verify  that  up  to  a  column  permutation  we  have 

Gr2,2  —  dzag  ;  2I(m— i)m^  (12.31) 

(m  —  l)m 

which  implies  that  det(G2,2)  =  2  2  . 

Therefore,  det(G(j.))  =  det(Gi,i)  det(G2,2)  =  det(Gm)  det(G2,2)  =  2m“. 

12.4  IQ  Imbalance  Issue 

IQ  imbalance  is  a  phenomenon  that  a  different  phase  and/or  amplitude  is  intro¬ 
duced  to  in  phase  and  quadrature  component.  We  conduct  an  analysis  and  tested  the  results 
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using  hardware  to  show  that  IQ  imbalance  could  affect  self-interference  cancellation  and  it 


also  can  be  canceled  properly  using  proper  designed  filterer. 


12.4.1  Model  and  Procedure 

Gaussian  pulse  is  used  during  this  experiment  for  its  dramatical  contrast  between 
the  passband  and  the  stopband.  Figure  12.1  shows  the  time  domain  and  frequency  domain 
representation  of  the  Gaussian  pulse  adopted  in  this  experiment. 


Baseband  Signal 


Bandwidth  in  MHz  with  sample  rate  125MHz 


Figure  12.1:  Time  domain  and  frequency  domain  representation  of  Gaussian  pulse  used  in 
this  experiment 


12.4.2  Tranining  Phase 

Before  fullduplex  transmitting  starts,  a  training  phase  is  needed  to  estimate  the 
channel  and  preparing  the  cancel  waveform.  During  training  phase,  waveform  transmitted 
by  Tx,  Cx  and  the  corresponding  received  waveform  at  Rx  is  listed  in  Table  12.1  below. 


98 


Four  consecutive  time  slots  are  used  for  transmitting  pilot  signal  during  the  training  phase. 
W(-W)  indicates  that  the  Tx  or  Cx  transmits  the  Gaussian  pulse  w[n\,  n  E  [1,  N]  in 
Figure  12.1(or  the  negtive  Gaussian  pulse).  RxR±  denotes  the  received  in-phase  signal 
at  slot  1  while  RxR  indicates  the  received  quadrature  at  slot  3.  Index  n  is  omitted  for 
convenience.  Therefore,  the  channel  responses  can  be  calculated  as  in  Table  12.2. 

The  prefilterred  signal  generated  with  real  model  for  fullduplex  can  be  described  as  the 
following  equation: 


Xar 

h\2  *  h 23  —  h-22  *  /ll3 

h\2  *  h-24  —  h’2 2  *  h\4 

Xai 

h-21  *  /i-13  ~  ^11  *  ^23 

/l2l  *  h\4  —  /in  *  /i24 

Xbr 

/l22  *  h\\  —  h\2  *  h,21 

0 

Xbi 

0 

/l22  *  /ill  —  hl2  *  /l21 

S'p 


Si 


(12.32) 


Slot  1 

Slot  2 

Slot  3 

Slot  4 

TxR 

W 

0 

W 

0 

Txl 

0 

W 

0 

w 

CxR 

w 

0 

-W 

0 

Cxi 

0 

w 

0 

-W 

RxR 

RxR± 

RxR2 

RxR:i 

RxRi 

Rxl 

Rxl\ 

Rxl2 

Rx  /;> 

RxR 

Table  12.1:  Transmitting  scheduled  for  training  phase 
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2hu  =  RxR\  +  RxRs  2/121  =  Rxl±  +  RxR 

2/112  =  RxR2  +  Rx  /?4  2/122  =  Rxl2  +  -Rx/4 

2/113  =  RxR]  —  RxRz  2/123  =  itc/i  —  RxR 

2/114  =  RxR2  —  RxR±  2/124  =  RxJ-2  —  RxR 


Table  12.2:  Prefiltered  signal  for  Fullduplex  Waveform 


In  other  hand,  prefilterred  signal  generated  with  complex  model  for  fullduplex  can  be  de¬ 
scribed  as  the  following  equation: 


Xar 

hi3  +  hi  4 

—  (h'23  +  ^24) 

%ai 

h-23  +  ^24 

hl3  +  /ll4 

Xbr 

—  (/in  +  h  12) 

/l21  +  /l22 

%bi 

—  (/121  +  /122) 

—  (/ill  +  hu) 

12.4.3  Experiment  and  Result 


Sy 


Si 


(12.33) 


To  compare  the  cancel  effect,  xcn  =  [ xar  xai  0  0]T  is  transmitted  to  show  the 
strength  of  the  uncanceled  self-interference  for  the  complex  model.  The  corresponding 
real  model  signal  is  xrn  =  [ xar  xai  0  0]T .  The  received  self  interference  waveform  by 
transmitting  ay,  ay,  xcn  and  xrn  are  denoted  as  ry,  ry,  Vm  and  fvT,  respectively.  We  can 
also  apply  baseband  cancellation  filter  to  further  reduce  the  interference  residual.  The 
baseband  cancellation  filter  problem  for  complex  method  can  be  described  as: 


nun.  \\rc-Wcbc\\l  (12.34) 

bc 
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where  Wc  is 


wc[  1]  0  0 

Wc[  1]  wc[  2]  0 

wc[l]  wc[2 }  iuc[3] 


0 


0 


0  0 
0  0 


(12.35) 


0  0  0  wc[N]  wc[N  —  1] 

0  0  0  0  wc[N } 

bc  is  the  adaptive  complex  filter  for  baseband  cancellation  and  wc[n]  is  the  nth  complex 
sample  of  the  transmitted  waveform.  The  optimal  bc  can  be  expressed  as: 


bc  =  (wcHwc)~]wcH^ 


(12.36) 


Traditional  convolution  complex  sequence  can  be  expressed  as: 

I< 

c[n ]  =  a[n }  *  b[n]  =  a  [A;]  x  b[n  —  k ] 
k=  1 

I\ 

=  ^2{[Re(a[k})  x  Re(b[n  —  k])  —  Im(a[k})  x  Im(b[n  —  &])] 
k=  1 

+j  [Re(a[k])  x  Im{b[n  —  k])  +  Im(a[k ])  x  Re(b[n  —  A;])]} 
It  can  also  be  expressed  in  matrix  form 


c[l] 

o[l]  0  0  0  0 

m 

c[2] 

a  [2]  a[l]  0  0  0 

b[  2] 

=  Ab  = 

°[K\ 

CS 

o 

o 

o 

o 

_ 1 

b[Kb] 

Re(c)  =  Re{A)Re(b)  —  Im(A)Im(b ) 


(12.37) 


(12.38) 


(12.39) 
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(12.40) 


Im(c )  =  Re(A)Im(b )  +  Im(A)Re(b ) 

where  /\a  and  7Q,  are  the  length  of  a[n]  and  6[n]  and  K  =  Ka  +  K),  —  1.  Use  d[n]  as  the 
input  and  c[n]  as  output,  the  process  of  real  method  complex  filter  b  can  be  described  as 

i?e(c[l])  Jm(c[l]) 

Re(c[  2])  7m(c[2]) 

=  Ab  (12.41) 

i?e(c[77])  /m(c[iv]) 

where 

i?e(a[l])  Im(a[  1])  0  0  0 

A  i?e(a[2])  /m(a[2])  7?e(a[l])  Jm(a[l])  0 

A  - 

0  0  0  0  i?e(a[ifa])  Im(a[il'a]) 

bn[l]  ^>12  [1] 

&2l[l]  &22 [1] 

frn[2]  bi2[2] 

621  [2]  ^22 [2]  (12.43) 

611  [7Q,]  612  [^6] 

&2l[-K&]  ^22  [Kb\ 

Here  b  has  twice  number  as  b.  Consider  the  optimization  problem  below: 

min.  ||r  —  Ab\\\  (12.44) 

b 


0 

0 

(12.42) 
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We  can  manipulate  the  expression  and  use  traditional  least  square  results  to  solve  this 
problem.  Let 


r  =  [rreT  rimT]T  =  [Re(r[  1]),  Re(r[  2]),  Re(r[Kr]),  •••  ,  Im{r[  1]),  Im{r[  2]),  •••  ,  Im(r[Kr])]T 

(12.45) 

b=  [bre  bim  Y  =  [6n[l],  621(1],  611  [2],  621(2],  •••  ,  6n[A&],  621^],  ••• 

6i2[1],  622(1],  6i2[2],  632(2],  •••  ,  612 [Kb\ ,  b22[Kb}]T  (12.46) 


and 


A 


i  6 
6  A 


(12.47) 


where  0  is  the  zero  matrix  same  size  as  A.  Now  the  optimization  problem  can  be  written 
as: 


min.  ||r  — A6H2  (12.48) 

b 

Further  more  it  can  ge  written  as  two  independent  optimization  with  smaller  matrix  size: 


min.  | \rre  —  Abre\\\  (12.49) 

bre 

and 

nun.  \\r^n  -  Abirn\\l  (12.50) 

him 

Experiments  under  two  environment  are  conducted.  The  first  experiment  shows  the  cancel¬ 
lation  results  of  RF,  RF  combined  with  traditional  baseband  cancel  filter  and  RF  combined 
with  the  real  method  complex  filter.  The  MXG  is  set  up  without  IQ  adjustment.  The 
second  experiment  shows  the  similar  results.  However,  the  IQ  adjustment  is  set  as  ldB  in 
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IQ  amplitude  and  10  degree  in  phase  for  Tx  and  -ldB  in  IQ  amplitude  and  -10  degree  for 
Cx,  which  is  the  largest  IQ  adjustment  that  allowed  on  the  system. 

The  result  of  the  first  experiment  is  shown  in  Figure  12.2  and  second  shown  in 
Figure  12.3.  A  comparison  over  of  the  RF  cancellation  over  two  experimental  environment 
is  also  shown  in  Figure  12.4 


5  10  15  20  25  30  35 

Without  IQ  adjutment  Cancellation  in  dB 


Figure  12.2:  Cancellation  comparison  without  IQ  adjustment. 
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Without  IQ  adjutment  Cancellation  in  dB 


Figure  12.3:  Cancellation  comparison  with  IQ  adjustment. 


Cancellation  comparison  over  IQ  adj  in  dB 


Figure  12.4:  Cancellation  comparison  with  IQ  adjustment. 
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Chapter  13 


Conclusion 


The  second  part  of  this  dissertation  has  presented  a  time  domain  transmit  beam¬ 
forming  method  and  an  adaptive  filter  method  for  realizing  full-duplex  wireless  communi¬ 
cation.  All  the  technique  are  supported  with  numerical  analysis  and  hardware  experiment. 
The  transmit  beamforming  method  is  verified  using  Agilent  MXA  and  MXG  and  a  WARP 
radio  platform.  45dB  cancellation  on  a  40MHz  signal  is  achieved  using  MXA  and  MXG, 
with  advanced  component  that  is  typically  used  in  laboratory.  27dB  cancellation  on  a 
20MHz  signal  is  achieved  using  the  Warp  platform  and  off-the-shelf  commercial  compo¬ 
nents.  The  cancellation  is  limited  by  the  impairment  of  the  transmitter,  such  as  phase 
noise  and  non-linear  effect.  A  solution  for  breaking  this  limit  is  the  proposed  adaptive  filter 
method,  which  creates  a  cancellation  channel  to  cancel  the  self-interference  channel.  Several 
architectures  of  the  cancellation  channel  are  presented  and  compared.  A  hardware  experi¬ 
ment  is  conducted  for  a  single  tap  case.  Currently  we  are  developing  a  multi-complex-tap 
PCB  board  for  further  investigation.  Other  issues  such  as  IQ  imbalance  are  also  discussed 
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and  simulated. 


To  realized  full-duplex  radio,  we  have  to  pay  special  attention  to  the  impairment 
of  the  component.  People  working  on  full-duplex  radio  cannot  make  many  assumptions 
that  are  only  true  for  the  half-duplex  scenario.  Self-interference  cancellation  demands  a 
system  that  can  accurately  cancel  a  waveform  up  to  lOOdB  or  more.  A  digital  domain 
numerical  approach  is  not  enough  as  shown  earlier.  Certain  modification  of  hardware  is 
required  to  achieve  a  large  amount  of  self-interference  cancellation.  Our  experiment  was 
based  on  components  designed  for  half-duplex  radio.  We  can  expect  a  more  practical  and 
robust  solution  with  advanced  components  designed  for  full-duplex  radio.  This  in  turn  will 
introduce  new  topics  associated  with  algorithm  development,  new  architectural  designs,  and 
even  possibly  revolutionary  changes  in  protocols. 

When  the  technique  to  suppress  self-interference  becomes  mature,  full-duplex  be¬ 
comes  feasible.  Full-duplex  radio  will  not  only  double  the  capacity  of  a  two-way  communi¬ 
cation  with  the  same  spectral  resource,  but  also  change  the  current  wireless  communication 
structure.  Current  radio  protocols,  architectures  and  regulations  are  developed  based  on 
half-duplex.  Self-interference  cancellation  technology  will  bring  fundamental  changes  in 
spectrum  management.  For  example,  in  WiFi  application,  co-channel  broadband  noise  and 
interference  can  be  eliminated  and  multiple  channels  can  be  used  to  simultaneously  boost 
the  speed.  With  full-duplex  radio,  future  generations  of  wireless  communication  will  be 
more  efficient  in  spectral  management  and  more  flexible  in  resource  distribution. 
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Appendix  A 


Detailed  Information  of  the 
Component  used  in  the 
Experiment. 
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Name 

Product  Name 

Website 

Signal  Generator 

Agilent  MXG  N5182 

www.home.agilent.com 

Spectrum  Analyzer 

Agilent  MXA  N9020A 

www.home.agilent.com 

WARP  Board 

Mango  Inc.  Warp  Project 

war  ppr  o  j  ect .  org /tr  ac 

Step  Attenuator(l) 

Mini  Circuit  ZX76-31R5-SP-S+ 

www.minicircuits.com 

Step  Attenuator(2) 

Peregrine  Semiconductor  PE43703 

www.psemi.com/index.php 

Vector  Modulator 

Hittite  HMC631LP3 

www.hittite.com 

USB  I/O  control  Box 

Mini  Circuit  USB  I/O  16D8R 

www.minicircuits.com 

Circulator 

Mini  Circuit  CS-2.500 

www.minicircuits.com 

Power  Combiner /Spliter 

Mini  Circuit  ZAPD-4-S+ 

www.minicircuits.com 

90°  Spliter 

Mini  Circuit  ZX10Q2-27-S+ 

www.minicircuits.com 

4  way  combiner 

Mini  Circuit  ZB4PD-42-S+ 

www.minicircuits.com 

Table  A.l:  List  of  the  key  component. 
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